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FUNGAL TRANSCRIPTIONAL ACTIVATOR USEFUL IN METHODS 
FOR PRODUCING POLYPEPTIDES 

Background of the Invention 

5 

Field of the Invention 

The present invention relates to isolated nucleic acid 
sequences encoding polypeptides having transcriptional 
activation activity and to the polypeptides. The invention also 

10 relates to nucleic acid constructs, vectors and host cells 
comprising the nucleic acid sequences. The invention further 
relates to host cells useful for the production of polypeptides 
in which the production or function of the transcriptional 
activator has been altered, as well as to methods for producing 

15 the polypeptides. 

Description of the Related Art 

The use of recombinant host cells in the expression of 
heterologous proteins has in recent years greatly simplified the 

20 production of large quantities of commercially valuable proteins 
which otherwise are obtainable only by purification from their 
native sources. Currently, there is a varied selection of 
expression systems from which to choose for the production of 
any given protein, including eubacterial and eukaryotic hosts. 

25 The selection of an appropriate expression system often depends 
not only on the ability of the host cell to produce adequate 
yields of the protein in an active state, - but, to a large 
extent, may also be governed by the intended end use of the 
protein. 

30 One problem frequently encountered is the high level of 
proteolytic enzymes produced by a given host cell or present in 
the culture medium. One suggestion has been to provide host 
organisms deprived of the ability to produce specific 
proteolytic compounds. For example, WO 90/00192 (Genencor, 
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Inc.) describes filamentous fungal hosts incapable of secreting 
enzymatically active aspartic proteinase. EP 574 347 (Ciba 
Geigy AG) describes Aspergillus hosts defective in a serine 
protease of the subtilisin-type. WO 98/12300 (Novo Nordisk A/S) 
5 describes hosts defective in a metalloprotease and an alkaline 
protease. WO 97/12045 (Genencor, Inc.) describes yeast and 
bacterial host systems, which are rendered protease deficient 
resulting from a disruption of a promoter sequence involved in 
the regulation of a protease gene. 

10 Mattern, I.E., et al . , (1992. Mol Gen Genet 234:332-336) 
describe a mutant strain of Aspergillus niger, which was shown 
to have only 1 to 2% of the extracellular protease activity of 
the parent strain, apparently due to a deficiency of at least 
two proteases, aspergillopepsin A and aspergillopepsin B. It 

15 was suggested that the protease deficient phenotype could 
result from a regulatory mutation affecting the expression of 
the genes coding for both proteases. 

The initiation of eukaryotic transcription at a specific 
promoter or set of promoters requires a eukaryotic 

20 transcriptional activator which is a polypeptide, but which is 
not itelf part of RNA polymerase. Many transcriptional 
activators bind to a specific site on the promoter to form a 
functional promoter necessary for the initiation of 
transcription of the polypeptide encoding sequence. However, a 

25 transcriptional activator may also be incorporated into an 
initiation complex only in the presence of other polypeptides. 
Polypeptides with transcriptional activation activity have been 
described in fungi, and a list of such polypeptides has been 
published (Dhawale, S.S., and Lane, A.C. 1993. Nucleic Acid 

30 Research 21:5537-5546). 

Solution proposed by the invention: 

It is an object of the present invention to provide improved 
methods for increasing production of polypeptides in host cells 
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in which the activity of a transcriptional activator involved 
in the regulation of protease production has been modified. 

Summary of the Invention 

5 A first aspect of the present invention relates to an 
isolated nucleic acid sequence encoding a transcriptional 
activator selected from the group consisting of: 

(a) a nucleic acid sequence having at- least 70% 
identity with the nucleic acid sequence of SEQ ID N0:1 or 

10 SEQ ID NO: 48; 

(b) a nucleic acid sequence encoding a polypeptide 
having an amino acid sequence which has at least 50% 
identity with the amino acid sequence of SEQ ID NO: 2 or 
SEQ ID NO: 49; 

15 (c) a nucleic acid sequence which hybridizes under low 

stringency conditions with (i) the nucleic acid sequence 
of SEQ ID N0:1 or SEQ ID NO: 48, or (ii) its 
complementary strand, wherein the low stringency 
conditions are defined by prehybridization and 

20 hybridization at 42°C in 5x SSPE, 0.3% SDS, 200 microg/ml 

sheared and denatured salmon sperm DNA, and 25% 
formamide, and wash conditions are defined by 50°C for 30 
minutes in 2X SSC, 0.2% SDS; 
(d) an allelic variant of (a), (b) , or (c) ; 

25 (e) a subsequence of (a), (b) , (c) , or (d) , wherein the 

subsequence encodes a polypeptide fragment which has 
transcriptional activation activity; and 
(f ) a subsequence of (a) , (b) (c) , or (d) , wherein the 
subsequence encodes a polypeptide with the amino acid 

30 sequence of SEQ ID NO: 3. 

The nucleic acid sequence shown in SEQ ID NO: 1 is the 
Aspergillus niger prtT gene encoding the transcriptional factor 
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shown in SEQ ID NO: 2 as described further below and in the 
Examples . 

The nucleic acid sequence shown in SEQ ID NO: 48 is the 
Aspergillus oryzae IF04177 prtT gene encoding the 
5 transcriptional factor shown in SEQ ID NO: 49. The A. oryzae 
prtT gene has a coding region starting in position 795 and 
ending at position 2931. The prtT gene has 4 introns in 
positions 1028-1135, 1538-1591, 2018-2066, and 2297-2347, 
respectively. This is described further below and in the 
10 Examples. 

In another aspect, the invention also relates to nucleic 
acid constructs, vectors and host cells comprising the nucleic 
acid sequences, and to the polypeptides encoded by the nucleic 
acid sequences. The invention further relates to host cells 
15 useful for the production of a polypeptide, in which the 
production or function of the transcriptional activator has 
been altered, as well as to methods for producing the 
polypeptide. 

20 Brief Description of the Figures 

Figure 1 shows a restriction map of the plasmid pPAP, the 
construction of which is described in Example 1. 

Figure 2 shows a restriction map of the plasmid 
pAopyrGcosArpl , the construction of which is described in 
25 Example 1. 

Figure 3 shows a restriction map of the plasmid pEESl, the 
constructioh of which is described in Example 1. 

Figure 4 shows a restriction map of the plasmid pDprt, the 
construction of which is described in Example 3. 
30 Figure 5 shows a restriction map of the plasmid pGPprt, the 
construction of which is described in Example 4 . 

Figure 6 shows the sequence of the insert in the two plas- 
mids containing the PCR fragment of the A. oryzae prtT Zn 2+ - 
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finger. ICA217 is the sequence from one of the plasmids and 
ICA218 is the sequence from the other. 

Figure 7 shows plasmid pDV8 a pSP65 (Promega™) based 
plasmid containing the HSV-tk gene on a 1.2 kb Bglll/BamHl 
5 fragment inserted between a 1.0 kb Xhol/Bglll fragment of the 
A. nidulans gpd promoter and a 0.8 kb BamHl/Hindlll fragment 
containing the A. nidulans trpC transcriptional terminator. 

Figure 8 shows the construction of pJaL554 described in 
Example 8. 

10 

Detailed Description of the Invention 

Nucleic Acid Sequences Encoding Transcriptional Activators 

A first aspect of the present invention relates to an 
15 isolated nucleic acid sequence encoding a transcriptional 
activator selected from the group consisting of: 

(a) a nucleic acid sequence having at least 70% 

identity with the nucleic acid sequence of SEQ ID NO:l or 
SEQ ID NO: 48; 

20 (b) a nucleic acid sequence encoding a polypeptide 

having an amino acid sequence which has at least 50% 
identity with the amino acid sequence of SEQ ID NO: 2 or 
SEQ ID NO: 49; 

(c) a nucleic acid sequence which hybridizes under low 
25 stringency conditions with (i) the nucleic acid sequence 

of SEQ ID NO:l or SEQ ID NO: 48, or (ii) its 
complementary strand, wherein the low stringency 
conditions are defined by prehybridization and 
hybridization at 42°C in 5x SSPE, 0.3% SDS, 200 micro 
30 g/ml sheared and denatured salmon sperm DNA, and 25% 

formamide, and wash conditions are defined by 50°C for 30 
minutes in 2X SSC, 0.2% SDS; 

(d) an allelic variant of (a) , (b) , or (c) ; 
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(e) a subsequence of (a), (b) , (c) , or (d) , wherein the 
subsequence encodes a polypeptide fragment which has 
transcriptional activation activity; and 

(f ) a subsequence of (a) , (b) , (c) , or (d) , wherein the 
5 subsequence encodes a polypeptide with the amino acid 

sequence of SEQ ID NO:3. 
The term "transcriptional activator'' as used herein refers 
to a polypeptide which has the capability to activate a 
specific promoter or set of promoters necessary for the 

10 initiation of transcription of the polypeptide encoding 
sequence to which it is linked. 

The term "isolated nucleic acid sequence" as used herein 
refers to a nucleic acid sequence which is essentially free of 
other nucleic acid sequences, e.g., at least about 20% pure, 

15 preferably at least about 40% pure, more preferably at least 
about 60% pure, even more preferably at least about 80% pure,, 
most preferably at least about 90% pure as determined by 
agarose electrophoresis. For example, an isolated nucleic acid 
sequence can be obtained by standard cloning procedures used in 

20 genetic engineering to relocate the nucleic acid sequence from 
its natural location to a different site where it will be 
reproduced. The cloning procedures may involve excision and 
isolation of a desired nucleic acid fragment comprising the 
nucleic acid sequence encoding the polypeptide, insertion of 

25 the fragment into a vector molecule, and incorporation of the 
recombinant vector into a host cell where multiple copies or 
clones of the nucleic acid sequence will be replicated. The 
nucleic acid sequence may be of genomic, cDNA, RNA, 
semisynthetic, synthetic origin, or any combinations thereof. 

30 In a preferred embodiment, the nucleic acid sequence has a 
degree of identity to the nucleic acid sequence set forth in 
SEQ ID N0:1 or SEQ ID NO: 48 of at least about 70%, preferably 
at least about 80%, more preferably at least about 85%, even 
more preferably at least about 90%, most preferably at least 
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about 95%, even most preferably at least about 97%, and even 
more preferred at least 99% identity, which encodes an active 
polypeptide. For purposes of the present invention, the degree 
of identity between two nucleic acid sequences is determined by 
5 the Clustal method (Higgins, 198 9, CABIOS 5:151-153) with an 
identity table, a gap penalty of 10, and a gap length penalty 
of 10. 

In an even more preferred embodiment, the nucleic acid 
sequence encoding a transcriptional activator has a nucleic 

10 acid sequence as set forth in SEQ ID NO:l or SEQ ID NO: 48. 

Modification of a nucleic acid sequence encoding a polypep- 
tide of the present invention may be necessary for the 
synthesis of polypeptides substantially similar to the polypep- 
tide. The term "substantially similar" to the polypeptide 

15 refers to non-naturally occurring forms of the polypeptide. 
These polypeptides may differ in some engineered way from the 
polypeptide isolated from its native source. For example, it 
may be of interest to synthesize variants of the polypeptide 
where the variants differ in specific activity, binding 

20 specificity and/or affinity, or the like using, e.g., site- 
directed mutagenesis. The analogous sequence may be con- 
structed on the basis of the nucleic acid sequence presented as 
the polypeptide encoding part of SEQ ID NO: 1 or SEQ ID NO: 48, 
e.g., a subsequence thereof, and/or by introduction of 

25 nucleotide substitutions which do not give rise to another 
amino acid sequence of the polypeptide encoded by the nucleic 
acid sequence, but which corresponds to the codon usage of the 
host organism intended for production of the enzyme, or by 
introduction of nucleotide substitutions which may give rise to 

30 a different amino acid sequence. For a general description of 
nucleotide substitution, see, e.g., Ford et al., 1991, Protein 
Expression and Purification 2: 95-107. 

* 

In another preferred embodiment, the present invention • 
relates to isolated nucleic acid sequences encoding 
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polypeptides having an amino acid sequence which has a degree 
of identity to the amino acid sequence set forth in SEQ ID NO: 2 
or SEQ ID NO: 49 of at least about 50%, preferably at least 
about 60%, preferably at least about 70%, more preferably at 
5 least about 80%, even more preferably at least about 90%, most 
preferably at least about 95%, even most preferably at least 
about 97%, and even more preferred at least 99%, which 
qualitatively retain the transcriptional activation activity of 
the polypeptides (hereinafter "homologous polypeptides"). 

10 In a preferred embodiment, the homologous polypeptides have 
an amino acid sequence which differs by five amino acids, 
preferably by four amino acids, more preferably by three amino 
acids , even more preferably by two amino acids, and most 
preferably by one amino acid from the amino acid sequence set 

15 forth in SEQ ID NO: 2 or SEQ ID NO: 49. For purposes of the 
present invention, the degree of identity between two amino 
acid sequences is determined by the Clustal method (Higgins, 
1989, supra) with an identity table, a gap penalty of 10, and a 
gap length penalty of 10. 

20 Hybridization indicates that by methods of standard Southern 
blotting procedures, the nucleic acid sequence hybridizes to an 
oligonucleotide probe corresponding to the polypeptide encoding 
part of the nucleic acid sequence shown in SEQ ID NO:l , under 
low to high stringency conditions (i.e., prehybridization and 

25 hybridization at 42°C in 5x SSPE, 0.3% SDS, 200 jig/ml sheared 
and denatured salmon sperm DNA, and either 25, 35 or 50% 
formamide for low, medium and high stringencies, respectively) . 
In order to identify a clone or DNA which is homologous with 
SEQ ID NO:l or SEQ ID NO: 48, the hybridization reaction is 

30 washed three times for 30 minutes each using 2X SSC, 0.2% SDS 
preferably at least 50°C, more preferably at least 55°C, more 
preferably at least 60°C, more preferably at least 65°C, even 
more preferably at least 70 °C, and most preferably at least 
75°C. 
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The nucleic acid sequence of SEQ ID NO:l or SEQ ID NO: 48, 
or a subsequence thereof, as well as the amino acid sequence of 
SEQ ID NO: 2 or SEQ ID NO : 49, or a partial sequence thereof, 
or the amino acid sequence of SEQ ID NO: 3, may be used to 
5 design an oligonucleotide probe to identify and isolate or 
clone a homologous gene of any genus or species according to 
methods well known in the art. 

In particular, such probes can be used for hybridization 
with the genomic or cDNA of the genus or species of interest, 

10 following standard Southern blotting procedures, in order to 
identify and isolate the corresponding gene therein. Such 
probes can be considerably shorter than the entire sequence, 
but should be at least 15, preferably at least 25, and more 
preferably at least 40 nucleotides in length. Longer probes 

15 can also be used. Both DNA and RNA probes can be used. The 
probes are typically labeled for detecting the corresponding 
gene (for example, with 32 P, 3 H, 35 S, biotin, or avidin) . For 
example, molecules to which a 32 P-, 3 H- or 35 S-labelled 
oligonucleotide probe hybridizes may be detected by use of X- 

20 ray film. 

Thus, a genomic, cDNA or combinatorial chemical library- 
prepared from such other organisms may be screened for DNA 
which hybridizes with the probes described above and which 
encodes a polypeptide with transcriptional activation activity. 

25 Genomic or other DNA from such other organisms may be separated 
by agarose or polyacrylamide gel electrophoresis, or other 
separation techniques. DNA from the libraries or the separated 
DNA may be transferred to and immobilized on nitrocellulose or 
other suitable carrier material. A clone or DNA which is 

30 homologous to SEQ ID NO:l or SEQ ID NO: 48 may then be 
identified following standard Southern blotting procedures. 

An allelic variant denotes any of two or more alternative 
forms of a gene occupying the same chomosomal locus. Allelic 
variation arises naturally through mutation, and may result in 
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phenotypic polymorphism within populations. Gene mutations can 
be silent (i.e., no change in the encoded polypeptide) or may 
encode polypeptides having altered amino acid sequences. 

The term "allelic variant of a polypeptide' 7 is a polypeptide 
5 encoded by an allelic variant of a gene. In a preferred 
embodiment, the nucleic acid sequence encoding a 
transcriptional activator of the present invention is an 
allelic variant of a nucleic acid sequence selected from the 
group consisting of nucleic acid sequences: (a) having at least 

10 70% identity with the nucleic acid sequence of SEQ ID NO:l or 
SEQ ID NO: 48 , (b) encoding a polypeptide having an amino acid 
sequence which has at least 50% identity with the amino acid 
sequence of SEQ ID NO: 2 or SEQ ID NO: 49, (c) which hybridizes 
under low stringency conditions with the nucleic acid sequence 

15 of SEQ ID NO:l or SEQ ID NO: 48, or its complementary strand, 
and (d) encoding a polypeptide having the amino acid sequence 
of SEQ ID NO:3. 

The present invention also encompasses nucleic acid 
sequences which differ from SEQ ID NO:l or SEQ ID NO: 48 by 

20 virtue of the degeneracy of the genetic code. The present 
invention also relates to subsequences of SEQ ID NO:l or SEQ ID 
NO: 49, wherein a subsequence of SEQ. ID NO: 1 is a nucleic acid 
sequence encompassed by SEQ ID N0:1 or SEQ ID NO: 48 except 
that one or more nucleotides from the 5 1 and/or 3 1 end have 

25 been deleted. Preferably, a subsequence of SEQ ID NO:l or SEQ 
ID NO: 48 encodes a polypeptide fragment which has 
transcriptional activation activity. In a more preferred 
embodiment, a subsequence of SEQ ID NO:l or SEQ ID NO: 48 
contains at least a nucleic acid sequence encoding the 

30 polypeptide sequence shown in SEQ ID NO: 3. 

The techniques used to isolate or clone a nucleic acid 
sequence encoding a polypeptide are known in the art and 
include isolation from genomic DNA, preparation from cDNA, or a 
combination thereof. The cloning of the nucleic acid sequences 
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of the present invention from such genomic DNA can be effected, 
e.g., by using methods based on polymerase chain reaction (PCR) 
or antibody screening of expression libraries to detect cloned 
DNA fragments with shared structural features. (See, e.g., 
5 Innis et al . , 1990, PCR: A Guide to Methods and Application, 
Academic Press, New York.) Other nucleic acid amplification 
procedures such as ligase chain reaction (LCR) , ligated 
activated transcription (LAT) and nucleic acid sequence-based 
amplification (NASBA) may be used. The nucleic acid sequence 
10 may be cloned from a microorganism, or another or related 
organism and thus, for example, may be an allelic or species 
variant of the polypeptide encoding region of the nucleic acid 
sequence . 

The transcriptional activators encoded by nucleic acid 

15 sequences which hybridize with an oligonucleotide probe which 
hybridizes with the nucleic acid sequence of SEQ ID N0:1 or SEQ 
ID NO: 48, its complementary strand, or allelic variants and 
subsequences of SEQ ID NO:l or SEQ ID NO: 48, or allelic 
variants and fragments of the transcriptional activators may be 

20 obtained from microorganisms of any genus. 

In a preferred embodiment, the transcriptional activators 
may be obtained from a fungal cell. * Fungi" as used herein 
includes the phyla Ascomycota, Basidiomycota, Chytridiomycota, 
and Zygomycota (as defined by Hawksworth, et al., in Ainsworth 

25 and Bisby's Dictionary of The Fungi, 8th edition, 1995, CAB 
International, University Press, Cambridge, UK) as well as the 
Oomycota (as cited in Hawksworth et al., 1995, supra, page 171) 
and all mitosporic fungi (Hawksworth ef al., 1995, supra). 

In preferred embodiment, the fungal source is a filamentous 

30 fungal strain. u Filamentous fungi" include all filamentous 
forms of the subdivision Eumycota and Oomycota (as defined by 
Hawksworth et al . , 1995, supra). The filamentous fungi are 
characterized by a mycelia wall composed of chitin, cellulose, 
glucan, chitosan, mannan, and other complex polysaccharides. 
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Vegetative growth is by hyphal elongation and carbon catabolism 
is obligately aerobic. Filamentous fungal strains include, but 
are not limited to, strains of Acremonium, Aspergillus, 
Aureobasidium, Cryptococcus , Filibasidium, Fusarium, Humicola, 
5 Magnaporthe, Mucdr, Myceliophthora, Neocallimastix, Neurospora, 
Paecilomyces , Penicillium, Piromyces, Schizophyllum, 

Talaromyces , Thermoascus, Thielavia, Tolypocladium, and 
Tri choderma . 

In a more preferred embodiment, the nucleic acid sequence 
10 encoding a transcriptional activator of the present invention 

is obtained from a strain of Aspergillus, such as A. awamori or 

A. nidulans. Preferably, the nucleic acid sequence is obtained 

from a strain of A. niger or A. oryzae. Even more preferably, 

the nucleic acid sequence is obtained from an isolate of a 
15 strain of A. niger, DSM 12298/ e.g., the nucleic acid sequence 

set forth in SEQ ID N0:1 , or from A. oryzae IFO 4177, i.e., 

the nucleic acid sequence set forth in SEQ ID NO: 48. 

In another more preferred embodiment, the nucleic acid 

sequence encoding a transcriptional activator of the present 
20 invention is obtained from a strain of Fusarium, such as F. 

oxysporum. Preferably, the strain is a strain of F. venenatum 

(Nirenberg sp. nov.) . 

In another preferred embodiment, the nucleic acid sequence 

encoding a transcriptional activator of the present invention 
25 is obtained from a yeast strain, such as a Candida, 

Kluyveromyces, Schizosaccharomyces , or Yarrowia strain. 

Preferably, the strain is a strain of Hansenula, Pichia, or 

Saccharomyces . 

It will be understood that for the aforementioned species, 
30 the invention encompasses both the perfect and imperfect 
states, and other' taxonomic equivalents, e.g., anamorphs, 
regardless of the species name by which they are known. Those 
skilled in the art will readily recognize the identity of 
appropriate equivalents. For example, the polypeptides may be 
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obtained from microorganisms, which are taxonomic equivalents 
of Aspergillus as defined by Raper, K.D. and Fennel, D.I. 
(1965. The Genus Aspergillus, The Wilkins Company, Baltimore 
MD) . regardless of the species name by which they are known. 
5 Aspergilli are mitosporic fungi characterized by an aspergillum 
comprised of a conidiospore stipe with no known teleomorphic 
states terminating in a vesicle, which in turn bears one or two 
layers of synchronously formed specialized cells, variously 
referred to as sterigmata or phialides, and asexually formed 

io spores referred to as conidia. Known teleomorphs of 

Aspergillus include Eurotium, Neosartorya, and Emericella. 
Strains of Aspergillus and teleomorphs thereof are readily 
accessible to the public in a number of culture collections, 
such as the American Type Culture Collection (ATCC) , Deutsche 

is Sammlung von Mikroorganismen und Zellkulturen GmbH (DSM) , 
Centraalbureau Voor Schimmelcultures (CBS) , and Agricultural 
Research Service Patent Culture Collection, Northern Regional 
Research Center (NRRL) . 

Furthermore, such transcriptional activators may be 

20 identified and obtained from other sources including 
microorganisms isolated from nature (e.g., soil, composts, 
water, etc.) using the above-mentioned probes. Techniques for 
isolating microorganisms from natural habitats are well known 
in the art. The nucleic acid sequence may then be derived by 

25 similarly screening a genomic or cDNA library of another 
microorganism. Once a nucleic acid sequence encoding a 
transcriptional activator has been detected with the probe (s) , 
the sequence may be isolated or cloned by utilizing techniques 
which are known to those of ordinary skill in the art (see, 

30 e.g., J. Sambrook, E.F. Fritsch, and T. Maniatus, 1989, 
Molecular Cloning, A Laboratory Manual, 2d edition, Cold Spring 
Harbor, New York) . 

In another preferred embodiment, the- isolated nucleic acid 
sequence encodes a polypeptide comprising the amino acid 
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sequence of SEQ ID NO: 2 or SEQ ID NO: 49, or a fragment 
thereof, which has transcriptional activation activity. 

In another preferred embodiment, the isolated nucleic acid 
sequence encodes a polypeptide comprising the amino acid 
5 sequence of SEQ ID NO: 3. 

The present invention also relates to isolated nucleic acid 
sequences encoding a transcriptional activator of the present 
invention, which, e.g., using methods of standard Southern 
blotting procedures described above (cf., Sambrook, et al., 

10 1989, supra), hybridize under low stringency conditions, more 
preferably medium stringency conditions, and most preferably 
high stringency conditions, with an oligonucleotide probe which 
hybridizes under the same conditions with the nucleic acid 
sequence set forth in SEQ ID NO:l or SEQ ID NO: 48 or its 

15 complementary strand, or allelic variants and subsequences of 
SEQ ID NO:l or SEQ ID NO: 4 8 which encode polypeptide 
fragments which are transcriptional activators in fungi. 

In another more preferred embodiment, the nucleic acid 
sequence is the nucleic acid sequence encoding a polypeptide, 

20 which has DNA binding activity contained in the plasmid pEES 
which is contained in Escherichia coli DSM 12294. 

Nucleic Acid Constructs 

Another aspect of the present invention relates to nucleic 
25 acid constructs comprising a nucleic acid sequence encoding a 
transcriptional activator of the present invention operably 
linked to one or more control sequences, which direct the 
production of the transcriptional activator in a suitable 
expression host. In a preferred embodiment, the nucleic acid 
30 sequence encodes a polypeptide, which is contained in the 
plasmid pEES harboured in Escherichia coli DSM 12294, or the 
nucleic acid sequence shown in SEQ ID NO: 48 encoding the 
polypeptide shown in SEQ ID NO: 49. 
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Expression will be understood to include any step involved 
in the production of the polypeptide including, but not limited 
to, transcription, post- transcriptional modification, 
translation, post-translational modification, and secretion. 
5 Manipulation of the nucleic acid sequence encoding a 
polypeptide prior to its insertion into a vector may be 
desirable or necessary depending on the expression vector. The 
techniques for modifying nucleic acid sequences utilizing 
cloning methods are well known in the art. 

10 "Nucleic acid construct" is defined herein as a nucleic acid 
molecule, either single- or double -stranded, which is isolated 
from a naturally occurring gene or which has been modified to 
contain segments of nucleic acid which are combined and 
juxtaposed in a manner which would not otherwise exist in 

15 nature. The term nucleic acid construct is synonymous with the 
term expression cassette when the nucleic acid construct 
contains all the control sequences required for expression of a 
coding sequence. The term ''coding sequence'' as defined herein 
is a sequence, which is transcribed into mRNA and translated 

20 into a transcriptional activator of the present invention. The 
boundaries of the coding sequence are generally determined by 
the ATG start codon at the 5' end of the mRNA and a 
transcription terminator sequence located just downstream of 
the open reading frame at the 3' end of the mRNA. A coding 

25 sequence can include, but is not limited to, DNA, cDNA, and 
recombinant nucleic acid sequences. 

The term "control sequences" is defined herein to include 
all components, which are necessary or advantageous for the 
expression of a polypeptide. Each control sequence may be 

30 native or foreign to the nucleic acid sequence encoding the 
polypeptide. Such control sequences include, but are not 
limited to, a leader, a polyadenylation sequence, a propeptide 
sequence, a promoter, a signal sequence, and a transcription 
terminator. At a minimum, the control sequences include a 
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promoter, and transcriptional and translational stop signals. 
The control sequences may be provided with linkers for the 
purpose of introducing specific restriction sites facilitating 
ligation of the control sequences with the coding region of the 
5 nucleic acid sequence encoding a polypeptide. The term 
"operably linked" is defined herein as a configuration in which 
a control sequence is appropriately placed at a position 
relative to the coding sequence of the DNA sequence such that 
the control sequence directs the production of a polypeptide. 

10 The control sequence may be an appropriate promoter 
sequence, a nucleic acid sequence, which is recognized by a 
host cell for expression of the nucleic acid sequence. The 
promoter sequence contains transcriptional control sequences, 
which mediate the expression of the polypeptide. The promoter 

15 may be any nucleic acid sequence, which shows transcriptional 
activity in the cell including mutant, truncated, and hybrid 
promoters, and may be obtained from genes encoding 
extracellular or intracellular polypeptides either homologous 
or heterologous to the cell. 

20 The control sequence may also be a suitable transcription 
terminator sequence, a sequence recognized by a filamentous 
fungal cell to terminate transcription. The terminator 
sequence is operably linked to the 3' terminus of the nucleic 
acid sequence encoding the polypeptide. Any terminator, which 

25 is functional in the cell, may be used in the present 
invention. 

Preferred terminators for filamentous fungal cells are 
obtained from the genes encoding Aspergillus oryzae TAKA 
amylase, Aspergillus niger glucoamylase, Aspergillus nidulans 
30 anthranilate synthase, Aspergillus niger alpha-glucosidase, and 
Fusarium oxysporum trypsin-like protease. 

The control sequence may also be a suitable leader sequence, 
a nontranslated region of a mRNA which is important for 
translation by the filamentous fungal cell. The leader 
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sequence is operably linked to the 5' terminus of the nucleic 
acid sequence encoding the polypeptide. Any leader sequence, 
which is functional in the cell, may be used in the present 
invention. 

5 Preferred leaders for filamentous fungal cells are obtained 
from the genes encoding Aspergillus oryzae TAKA amylase and 
Aspergillus nidulans triose phosphate isomerase. 

The control sequence may also be a polyadenylation sequence, 
a sequence which is operably linked to the 3' terminus of the 

10 nucleic acid sequence and which, when transcribed, is 
recognized by the filamentous fungal cell as a signal to add 
polyadenosine residues to transcribed mRNA. Any 
polyadenylation sequence, which is functional in the cell, may 
be used in the present invention. 

15 Preferred polyadenylation sequences for filamentous fungal 
cells are obtained from the genes encoding Aspergillus oryzae 
TAKA amylase, Aspergillus niger glucoamylase, Aspergillus 
nidulans anthranilate synthase, and Aspergillus niger alpha - 
glucosidase . 

20 The control sequence may also be a signal peptide-coding 
region, which codes for an amino acid sequence linked to the 
amino terminus of the polypeptide, which can direct the encoded 
polypeptide into the cell's secretory pathway. The 5' end of 
the coding sequence of the nucleic acid sequence may inherently 

25 contain a signal peptide-coding region naturally linked in 
translation reading frame with the segment of the coding 
region, which encodes the secreted polypeptide. Alternatively, 
the 5' end of the coding sequence may contain a signal peptide- 
coding region, which is foreign to the coding sequence. The 

30 foreign signal peptide-coding region may be required where the 
coding sequence does not normally contain a signal peptide- 
coding region. Alternatively, the foreign signal peptide- 
coding region may simply replace the natural signal peptide- 
coding region in order to obtain enhanced secretion of the 
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polypeptide. The signal peptide -coding region may be obtained 
from a glucoamylase or an amylase gene from an Aspergillus 
species, or a lipase or proteinase gene from a Rhizomucor 
species. However, any signal peptide -coding region, which 
5 directs the expressed polypeptide into the secretory pathway of 
a filamentous fungal cell, may be used in the present 
invention. 

An effective signal peptide coding region for filamentous 

fungal cells is the signal peptide coding region obtained from 
10 the Aspergillus oryzae TAKA amylase gene, Aspergillus niger 

neutral amylase gene, Rhizomucor miehei aspartic proteinase 

gene, or Humicola lanuginosa cellulase gene. 

The control sequence may also be a propeptide coding region, 

which codes for an amino acid sequence positioned at the amino 
15 terminus of a polypeptide. The resultant polypeptide is known 

as a proenzyme or propolypeptide (or a zymogen in some cases) . 

A propolypeptide is generally inactive and can be converted to 

a mature active polypeptide by catalytic or autocatalytic 

cleavage of the propeptide from the propolypeptide. The 
20 propeptide coding region may be obtained from the Rhizomucor 

miehei aspartic proteinase gene, or the Myceliophthora 

thermophila laccase gene (WO 95/33 836) . 

Where both signal peptide and propeptide regions are present 

at the amino terminus of a polypeptide, the propeptide region 
25 is positioned next to the amino terminus of a polypeptide and 

the signal peptide region is positioned next to the amino 

terminus of the propeptide region. 

Expression Vectors 

30 The present invention also relates to recombinant expression 
vectors comprising a nucleic acid sequence of the present 
invention, a promoter, and transcriptional and translational 
stop signals. The various nucleic acid and control sequences 
described above may be joined together to produce a recombinant 



WO 01/68864 



PCT/DK01/00169 



19 

expression vector which may include one or more convenient 
restriction sites to allow for insertion or substitution of the 
nucleic acid sequence encoding the polypeptide at such sites. 
Alternatively, the nucleic acid sequence encoding the 
5 polypeptide may be expressed by inserting the sequence or a 
nucleic acid construct comprising the sequence into an 
appropriate vector for expression. In creating the expression 
vector, the coding sequence is located in the vector so that 
the coding sequence is operably linked with the appropriate 

10 control sequences for expression, and possibly secretion. 

The recombinant expression vector may be any vector (e.g., a 
plasmid or virus) , which can be conveniently subjected to 
recombinant DNA procedures and can bring about the expression 
of the nucleic acid sequence encoding the polypeptide. The 

15 choice of the vector will typically depend on the compatibility 
of the vector with the filamentous fungal cell into which the 
vector is to be introduced. The vectors may be linear or 
closed circular plasmids. The vector may be an autonomously 
replicating vector, i.e., a vector, which exists as an 

20 extrachromosomal entity, the replication of which is 
independent of chromosomal replication, e.g., a plasmid, an 
extrachromosomal element, a minichromosome, or an artificial 
chromosome. The vector may contain any means for assuring 
self -replication. Alternatively, the vector may be one which, 

25 when introduced into the filamentous fungal cell, is integrated 
into the genome and replicated together with the chromosome (s) 
into which it has been integrated. The vector system may be a 
single vector or plasmid or two or more vectors or plasmids, 
which together contain the total DNA to be introduced into the 

30 genome of the filamentous fungal cell, or a transposon. 

The vectors preferably contain one or more selectable 
markers, which permit easy selection of transformed cells. A 
selectable marker is a gene the product of which provides for 
biocide or viral resistance, resistance to heavy metals, 
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prototrophy to auxotrophs, and the like. A selectable marker 
for use in a filamentous fungal cell may be selected from the 
group including, but not limited to, amdS (acetamidase) , argrS 
(ornithine carbamoyl transferase) , bar (phosphinothricin 
5 acetyl transferase) , hygB (hygromycin phosphotransferase) , niaD 
(nitrate reductase), pyrG (orotidine-5 ' -phosphate 

decarboxylase), sC (sulfate' adenyl transferase) , and trpC 
(anthranilate synthase) , as well as equivalents from other 
species. Preferred for use in an Aspergillus cell are the amdS 

10 and pyrG genes of Aspergillus nidulans or Aspergillus oryzae 
and the bar gene of Streptomyces hygroscopicus. 

The vectors preferably contain an element (s) that permits 
stable integration of the vector into the host cell genome or 
autonomous replication of the vector in the cell independent of 

15 the genome of the cell . 

Host Cells 

Another aspect of the present invention relates to host 
cells comprising a nucleic acid construct or an expression 

20 vector of the present invention. 

The choice of a host cell in the methods of the present 
invention will to a large extent depend upon the source of the 
nucleic acid sequence encoding the polypeptide of interest. 

The introduction of an expression vector or a nucleic acid 

25 construct into a filamentous fungal cell may involve a process 
consisting of protoplast formation, transformation of the 
protoplasts, and regeneration of the cell wall in a manner 
known per se. Suitable procedures for transformation of 
Aspergillus cells are described in EP 238 023 and Yelton et 

30 al., 1984, Proceedings of the National Academy of Sciences USA 
81: 1470-1474. A suitable method of transforming Fusarium 
species is described by Malardier et al., 1989, Gene 78: 147- 
156 or in WO 96/00787. 
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"Introduction" means introducing a vector comprising the 
nucleic acid sequence encoding the polypeptide into a 
filamentous fungal cell so that the vector is maintained as a 
chromosomal integrant or as a self -replicating extra- 
5 chromosomal vector. Integration is generally considered to be 
an advantage as the nucleic acid sequence is more likely to be 
stably maintained in the cell. Integration of the vector into 
the chromosome occurs by homologous recombination, non- 
homologous recombination, or transposition. 

10 For integration into the host cell genome, the vector may 
rely on the nucleic acid sequence encoding the polypeptide or 
any other element of the vector for stable integration of the 
vector into the genome by homologous or nonhomologous 
recombination. Alternatively, the vector may contain 

15 additional nucleic acid sequences for directing integration by 
homologous recombination into the genome of the host cell. -The 
additional nucleic acid sequences enable the vector to be 
integrated into the host cell genome at a precise location (s) 
in the chromosome (s) . To increase the likelihood of 

20 integration at a precise location, the integrational elements 
should preferably contain a sufficient number of nucleic acids, 
such as 100 to 1,500 base pairs, preferably 400 to 1,500 base 
pairs, and most preferably 800 to 1,500 base pairs, which are 
highly homologous with the corresponding target sequence to 

25 enhance the probability of homologous recombination. The 
integrational elements may be any sequence that is homologous 
with the target sequence in the genome of the host cell. These 
nucleic acid sequences may be any sequence that is homologous 
with a target sequence in the genome of the host cell, and, 

30 furthermore, may be non-encoding or encoding sequences. 

For autonomous replication, the vector may further comprise 
an origin of replication enabling the vector to replicate 
autonomously in the host cell. 
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The procedures used, to ligate the elements described above 
to construct the recombinant expression vectors are well known 
to one skilled in the art (see, e.g., Sambrook, et al . , supra). 
In a preferred embodiment, the filamentous fungal host cell 
5 is a cell of a species of, but not limited to, Acremonium, 
Aspergillus, Fusarium, Humicola, Mucor, Myceliophthora, 
Neurospora/ Penicillium, Thielavia, Tolypocladium , or 
Trichoderma. 

In a more preferred embodiment, the filamentous fungal cell 

10 is an Aspergillus cell. In another more preferred embodiment, 
the filamentous fungal cell is an Acremonium cell. In another 
more preferred embodiment, the filamentous fungal cell is a 
Fusarium cell. In another more preferred embodiment, the 
filamentous fungal cell is a Humicola cell. In another more 

15 preferred embodiment, the filamentous fungal cell is a Mucor 
cell. In another more preferred embodiment, the filamentous 
fungal cell is a Myceliophthora cell. In another more 
preferred embodiment, the filamentous fungal cell is a 
Neurospora cell. In another more preferred embodiment, the 

20 filamentous fungal cell is a Penicillium cell. In another more 
preferred embodiment, the filamentous fungal cell is a 
Thielavia cell. In another more preferred embodiment, the 
filamentous fungal cell is a Tolypocladium cell. In another 
more preferred embodiment/ the filamentous fungal cell is a 

25 Trichoderma cell. 

In a most preferred embodiment, the filamentous fungal cell 
is an Aspergillus awamori, Aspergillus foetidus, Aspergillus 
japonicus, Aspergillus nidulans, Aspergillus niger or 
Aspergillus oryzae cell. In another most preferred embodiment, 

30 the filamentous fungal cell is a Fusarium bactridioides, 
Fusarium cerealis, Fusarium crookwellense, Fusarium culmorum, 
Fusarium graminearum, Fusarium graminum, Fusarium heterosporum, 
Fusarium negundi, Fusarium oxysporum, Fusarium reticulatum, 
Fusarium roseum, Fusarium sambucinum, Fusarium sarcochroum, 
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Fusarium sulphureum, Fusarium toruloseum, Fusarium 
trichothecioides, or Fusarium venenatum cell. In an even most 
preferred embodiment, the filamentous fungal cell is a Fusarium 
venenatum (Nirenberg sp. nov.). In another most preferred 
5 embodiment, the filamentous fungal cell is a Humicola insolens 
or Humicola lanuginosa cell. In another most preferred 
embodiment, the filamentous fungal cell is a Mucor miehei cell. 
In another most preferred embodiment, the filamentous fungal 
cell is a Myceliophthora thermophilum cell. In another most 

10 preferred embodiment, the filamentous fungal cell is a 
Neurospora crassa cell. In another most preferred embodiment, 
the filamentous fungal cell is a Penicillium purpurogenum cell. 
In another most preferred embodiment, the filamentous fungal 
cell is a Thielavia terrestris cell. In another most preferred 

15 embodiment, the Trichoderma cell is a Trichoderma harzianum, 
Trichoderma koningii, Trichoderma longibrachiatum, Trichoderma 
reesei or Trichoderma viride cell. 

Polypeptides having Transcriptional Activation Activity 

20 Another aspect of the present invention relates to an 
isolated polypeptide selected from the group consisting of: 

(a) a polypeptide which is encoded in a nucleic acid 

sequence which hybridizes under low stringency conditions 
with (i) the nucleic acid sequence of SEQ ID N0:1 or SEQ 

25 ID NO: 48; (ii) its complementary strand, or (iii) a 

subsequence of SEQ ID NO:l or SEQ ID NO: 48 which 
encodes a polypeptide fragment which has transcriptional 
activation activity, wherein the low stringency 
conditions are defined by prehybridization and 

30 hybridization at 42°C in 5X SSPE, 0.3% SDS, 200 micro 

g/ml sheared and denatured salmon sperm DNA, and 25% 
formamide, and wash conditions are defined at 50°C for 30 
minutes in 2X SSC, 0.2% SDS; 
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(b) a polypeptide having an amino acid sequence which 
has at least 50% identity with the amino acid sequence of 
SEQ ID NO: 2 or SEQ ID NO: 49; 

(c) an allelic variant of (a) or (b) ; 

5 (d) a fragment of (a) , (b) , or (c) , wherein the 

fragment has transcriptional activation activity; and 1 
(e) a polypeptide comprising the amino acid sequence of 

SEQ ID NO: 3, or an allelic variant thereof. 
The transcriptional activator may be isolated using 
10 techniques as described herein. As defined herein, an 
"isolated" polypeptide is a polypeptide which is essentially 
free of other polypeptides, e.g., at least about 20% pure, 
preferably at least about 40% pure, more preferably about 60% 
pure, even more preferably about 80% pure, most preferably 
15 about 90% pure, and even most preferably about 95% pure, as 
determined by SDS-PAGE. 

The present invention also relates to isolated polypeptides 
having an amino acid sequence which has a degree of identity to 
the amino acid sequence of SEQ ID NO: 2 or SEQ ID NO: 49 of at 
20 least about 50%, preferably at least about 55%, preferably at 
least about 60%, preferably at least about 65%, preferably at 
least about 70%, preferably at least about 75%, preferably at 
least about 80%, more preferably at least about 85%, even more 
preferably at least about 90%, most preferably at least about 
25 95%, and even most preferably at least about 97%, even more 
preferred at least 99%, which have transcriptional activation 
activity. 

In more preferred embodiment, the transcriptional activator 
of the present invention comprises the amino acid sequence of 
30 SEQ ID NO: 2 or SEQ ID NO: 49 or a fragment thereof, wherein the 
fragment retains transcriptional activation activity. In a 
most preferred embodiment, the polypeptide has the amino acid 
sequence of SEQ ID NO: 2 or SEQ ID NO: 49. A fragment of SEQ 
ID NO: 2 or SEQID NO: 49 is a polypeptide having one or more 
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amino acids deleted from the amino and/or carboxy terminus of 
this amino acid sequence. Preferably, a fragment of SEQ ID 
NO: 2 or SEQ ID NO: 4 9 contains at least the polypeptide 
sequence shown in SEQ ID NO:3. 
5 The amino acid sequences of the homologous polypeptides may 
differ from the amino acid sequence of SEQ ID NO: 2 or SEQ ID 
NO: 49 or SEQ ID NO: 3 by an insertion or deletion of one or 
more amino acid residues and/or the substitution of one or more 
amino acid residues by different amino acid residues. 

10 Preferably, amino acid changes are of a minor nature, that is 
conservative amino acid substitutions that do not significantly 
affect the folding and/or activity of the protein; small 
deletions, typically of one to about 30 amino acids; small 
amino- or carboxyl- terminal extensions, such as an amino- 

15 terminal methionine residue; a small linker peptide of up to 
about 20-25 residues; or a small extension that facilitates 
purification by changing net charge or another function, such 
as a poly-histidine tract, an antigenic epitope or a binding 
domain . 

20 Examples of conservative substitutions are within the group 
of basic amino acids (such as arginine, lysine and histidine) , 
acidic amino acids (such as glutamic acid and. aspartic acid) , 
polar amino acids (such as glutamine and asparagine) , 
hydrophobic amino acids (such as leucine, isoleucine and 

25 valine) , aromatic amino • acids (such as phenylalanine, 
tryptophan and tyrosine) , and small amino acids (such as 
glycine, alanine, serine, threonine and methionine) . Amino 
acid substitutions, which do not generally alter the specific 
activity are known in the art and are described, for example, 

30 by H. Neurath and R.L. Hill (1979. The Proteins, Academic 
Press, New York) . The most commonly occurring exchanges are 
Ala/Ser, Val/Ile, Asp/Glu, Thr/Ser, Ala/Gly, Ala/Thr, Ser/Asn, 
Ala/Val, Ser/Gly, Tyr/Phe, Ala/Pro, Lys/Arg, Asp/Asn, Leu/Ile, 
Leu/Val, Ala/Glu, and Asp/Gly as well as these in reverse. 
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In a more preferred embodiment, a transcriptional activator 
of the present invention is obtained from an Aspergillus niger 
strain, more preferably from Aspergillus niger AB4 . 1 (van 
Haringsveldt , W. , et al . , 1987. Mol . Gen. Genet. 206:71-75), 
5 and most preferably from Aspergillus niger 13PAP2, which has 
been deposited at DSM as DSM 122 98, or a mutant strain thereof, 
harbouring, e.g., the polypeptide with the amino acid sequence 
of SEQ ID NO: 2 or SEQ ID NO: 49 or SEQ ID NO: 3. 

In another preferred embodiment, the transcriptional 

10 activator of the present invention is the polypeptide encoded 
in the nucleic acid sequence contained in plasmid pEES, which 
is contained in Escherichia coli DSM 12294 or the nucleic acid 
sequence shown in SEQ ID NO: 48 encoding the polypeptide shown 
in SEQ ID NO: 49. 

15 The present invention further relates to methods for 
producing the transcriptional activator of the present 
invention comprising (a) cultivating a host cell harbouring a 
nucleic acid construct or an expression vector comprising a 
nucleic acid sequence encoding the transcriptional activator of 

20 the invention under conditions conducive for production of the 
polypeptide; and (b) recovering the polypeptide. 

Host Cells having Altered Transcriptional Activation Activity 

Another aspect of the present invention relates to a host 
25 cell which is a mutant of a parent fungal cell useful for the 
production of a polypeptide in which the parent cell comprises 
one or more nucleic acid sequences encoding a protease, the 
transcription of which is activated by a transcriptional 
activator of the present invention, and the mutant cell 
30 produces less of the transcriptional activator and the 
protease (s) than the parent cell when cultured under the same 
conditions . 
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The mutant cell may be constructed using methods well known 
in the art; for example, by one or more nucleotide insertions or 
deletions of the gene encoding the transcriptional activator. 

In a preferred embodiment the mutant cell is obtained by 
5 modification or inactivation of a nucleic acid sequence present 
in the cell and necessary for expression of the transcriptional 
activator. 

In a more preferred embodiment, the nucleic acid sequence is 
selected from the group consisting of: (a) a nucleic acid 

10 sequence having at least 70% identity with the nucleic acid 
sequence of SEQ ID N0:1 or SEQ ID NO: 48/ (b) a nucleic acid 
sequence encoding a polypeptide having an amino acid sequence 
which has at least 50% identity with the amino acid sequence of 
SEQ ID NO: 2 or SEQ ID NO: 49; (c) a nucleic acid sequence which 

15 hybridizes under low stringency conditions with (i) the nucleic 
acid sequence of SEQ ID NO:l or SEQ ID NO: 48, or (ii) its 
complementary strand, (d) an allelic variant of (a) , (b) , or 
(c) ; (e) a subsequence of (a) , (b) , (c) , or (d) , wherein the 
subsequence encodes a polypeptide fragment which has 

20 transcriptional activation activity; and (f) a subsequence of 
(a) , (b) (c) , or (d) , wherein the subsequence encodes a 
polypeptide with the amino acid sequence of SEQ ID NO:3. 

In another preferred embodiment the reduced expression of the 
transcriptional activator in the mutant cell is obtained by 

25 modification or inactivation of a control sequence required for 
the expression of the transcriptional activator. The term 
11 control sequence" is defined, supra, in the section entitled 
"Nucleic Acid Constructs." In a more preferred embodiment the 
control sequence in the mutant cell is a promoter sequence or a 

30 functional part thereof, i.e., a part, which is sufficient for 
affecting expression of the nucleic acid sequence. Other 
control sequences for possible modification include, but are 
not limited to, a leader, a polyadenylation sequence, a 
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propeptide sequence, a signal sequence, and a transcription 
terminator. 

Modification or inactivation of the gene may be performed by 
subjecting the parent cell to mutagenesis and selecting for 
5 mutant cells in which the capability to produce a 
transcriptional activator has been reduced. The mutagenesis, 
which may be specific or random, may be performed, for example, 
by use. of a suitable physical or chemical mutagenizing agent, 
by use of a suitable oligonucleotide, or by subjecting the DNA 

10 sequence to PCR generated mutagenesis. Furthermore, the 
mutagenesis may be performed by use of any combination of these 
mutagenizing agents. 

Examples of a physical or chemical mutagenizing agent 
suitable for the present purpose include ultraviolet (UV) 

15 irradiation, hydroxyl amine, N-methyl-N* -nitro-N- 

nitrosoguanidine (MNNG) , O-methyl hydroxyl amine, nitrous acid, 
ethyl methane sulphonate (EMS) , sodium bisulphite, formic acid, 
and nucleotide analogues. 

When such agents are used, the mutagenesis is typically 

20 performed by incubating the parent cell to be mutagenized in 
the presence of the mutagenizing agent of choice under suitable 
conditions, and selecting for mutant cells exhibiting reduced 
expression of the gene. 

Modification or inactivation of the gene may be accomplished 

25 by introduction, substitution, or removal of one or more 
nucleotides in the gene's nucleic acid sequence or a regulatory 
element required for the transcription or translation thereof. 
For example, nucleotides may be inserted or removed so as to 
result in the introduction of a stop codon, the removal of the 

30 start codon, or a change of the open reading frame. Such 
modification or inactivation may be accomplished by site- 
directed mutagenesis or PCR generated mutagenesis in accordance 
with methods known in the art. Although, in principle, the 
modification may be performed in vivo, i.e., directly on the 
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fungal cell expressing the gene to be modified, it is preferred 
that the modification be performed in vitro as exemplified 
below. 

An example of a convenient way to inactivate or reduce 
5 expression of the gene by a fungal cell of choice is based on 
techniques of gene replacement or gene interruption. For 
example, in the gene interruption method, a nucleic acid 
sequence corresponding to the endogenous gene or gene fragment 
of interest is mutagenized in vitro to produce a defective 

10 nucleic acid sequence which is then transformed into the parent 
cell to produce a defective gene. By homologous recombination, 
the defective nucleic acid sequence replaces the endogenous 
gene or gene fragment. It may be desirable that the defective 
gene or gene fragment also encodes a marker, which may be used 

is for selection of transformants in which the nucleic acid 
sequence has been modified or destroyed. 

Alternatively, modification or inactivation of the gene may 
be performed by established anti-sense techniques using a 
nucleotide sequence complementary to the nucleic acid sequence 

20 of the gene. More specifically, expression of the gene by a 
filamentous fungal cell may be reduced or eliminated by 
introducing a nucleotide sequence complementary to the nucleic 
acid sequence, which may be transcribed in the cell and is 
capable of hybridizing to the mRNA produced in the cell. Under 

25 conditions allowing the complementary anti- sense nucleotide 
sequence to hybridize to the mRNA, the amount of protein 
translated is thus reduced or eliminated. 

A nucleic acid sequence complementary to the nucleic acid 
sequence of SEQ ID NO:l or SEQ ID NO: 48 may be obtained from 

30 any microbial source. The preferred sources are fungal 
sources, e.gr. , yeast and filamentous fungi as described supra. 
Preferred filamentous fungal sources include, but are not 
limited to, species of Acremonium, Aspergillus, Fusarium, 
Humicola, Myceliophthora, Mucor, Neurospora, Penicillium, 
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Phanerochaete , Thielavia, Tolypocladium, and Trichoderma. 
Preferred yeast sources include, but are not limited to, 
species of Candida, Hansenula, Kluyveromyces , Pichia, 
Saccharomyces , Schizosaccharomyces , and Yar.rowia. Furthermore, 
5 the nucleic acid sequence may be native to the filamentous 
fungal cell . 

In another preferred embodiment, the parent cell harbours a 
gene having a nucleic acid sequence encoding a polypeptide with 
an amino acid sequence which has at least 50% identity with the 

10 amino acid sequence of SEQ ID NO: 2 or SEQID NO: 49. 

In another preferred embodiment, the parent cell harbours a 
gene having a nucleic acid sequence with at least 70% identity 
with the nucleic acid sequence of SEQ ID NO:l or SEQ ID NO: 48. 
In another preferred embodiment, the mutant cell harbours a 

15 nucleic acid sequence, which has been modified or inactivated by 
any of the methods described above and produces less of a 
protease or a combination of proteases than the parent cell when 
cultured under identical conditions. The mutant cell produces 
preferably at least about 25% less, more preferably at least 

20 about 50% less, even more preferably at least about 75% less, 
and even more preferably at least about 95% less of a protease 
or a combination of proteases than the parent cell when cultured 
under identical conditions. 

In an even more preferred embodiment, the mutant cell 

25 produces essentially undetectable amounts of a protease or 
combination of proteases than the parent cell when cultured 
under identical conditions. 

The protease (s) may be assayed using known methods. In one 
such method, an aliquot of a 48 hour culture media is incubated 

30 with 3 H-labelled sperm whale myoglobin at pH 4.0 and the 
radioactivity in the TCA-soluble fraction is measured (van 
Noort, J.M., et al . , 1991. Anal. Biochem 198:385-390). Other 
methods have been described for identifying, e.g., aspartic 
proteinase A. of A. niger (Takahashi, K. , 1991. Meth. in 
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Enzymol. 248:146-155), endopeptidases (Morihara, K. , 1995. 
Meth. in Enzymol . 248:242-253), carboxypeptidases (Reminton, 
J., and Breddam, K. , 1994. Meth. in Enzymol. 244:231-248), 
dipeptidyl peptidase (Ikehara, Y., et al., 244:215-227), and 
5 aminopeptidases (Little, G. , et al., 1976. Meth. in Enzymol. 
45:495-503) . 

In another preferred embodiment, the mutant cell harbours at 
least one copy of a nucleic acid sequence encoding a polypeptide 
of interest. 

10 Another aspect of the present invention relates to a host 
cell useful for the production of a polypeptide wherein the 
host cell is a mutant of a parent fungal cell in which the 
mutant (a) produces more of the transcriptional activator of 
the present invention as compared to the parent cell when 

is cultured under the same conditions; and (b) comprises a DNA 
sequence encoding the polypeptide, the transcription of which 
is activated by the transcriptional activator. 

In a preferred embodiment, the host cell produces more of 
the transcriptional activator than the parent cell when 

20 cultured under the same conditions by introducing into the 
parent cell one or more copies of (i) a nucleic acid sequence 
encoding a transcriptional activator, (ii) a nucleic acid 
construct comprising a nucleic acid sequence encoding a 
transcriptional activator, or (iii) an expression vector as 

25 defined above in the section "Expression Vectors" . 

The nucleic acid construct comprising a nucleic acid 
sequence encoding a transcriptional activator of the present 
invention may also comprise one or more nucleic acid sequences 
which encode one or more factors that are advantageous for 

30 directing the expression of the polypeptide, e.g., a 
transcriptional activator (e.g., a trans-acting factor), a 
chaperone, and a processing protease. Any factor that is 
functional in the filamentous fungal cell of choice may be used 
in the present invention. The nucleic acids encoding one or 
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more of these factors are not necessarily in tandem with the 
nucleic acid sequence encoding the polypeptide. 

An activator is a protein, which activates transcription of 
a nucleic acid sequence encoding a polypeptide (Kudla et al . , 
5 1990, EMBO Journal 9: 1355-1364; Jarai and Buxton, 1994, 
Current Genetics 26: 2238-244; Verdier, 1990, yeast 6: 271- 
2 97) . The nucleic acid sequence encoding an activator may be 
obtained from the genes encoding Saccharomyces cerevisiae heme 
activator protein 1 (Jiapl) , Saccharomyces cerevisiae galactose 

10 metabolizing protein 4 (gal4) , Aspergillus nidulans ammonia 
regulation protein (areA) , and Aspergillus oryzae alpha- amylase 
activator (amyR) . For further examples, see Verdier, 1990, 
supra and MacKenzie et al,, 1993, Journal of General 
Microbiology 139: 2295-2307. 

15 A chaperone is a protein which assists another polypeptide 
in folding properly (Hartl et al., 1994, TIBS 19: 2 0-25; 
Bergeron et al . , 1994, TIBS 19: 124-12 8; Demolder et al., 1994, 
Journal of Biotechnology 32 : 179-189; Craig, 1993, Science 260: 
1902-1903; Gething and Sambrook, 1992, Nature 355: 33-45; Puig 

20 and Gilbert, 1994, Journal of Biological Chemistry 269: 7764- 
7771; Wang and Tsou, 1993, The FASEB Journal 7: 1515-11157; 
Robinson et al., 1994, Bio /Technology 1: 381-384; Jacobs et 
al., 1993, Molecular Microbiology 8: 957-966). The nucleic 
acid sequence encoding a chaperone may be obtained from the 

25 genes encoding Aspergillus oryzae protein disulphide isomerase 
or Saccharomyces cerevisiae calnexin, Saccharomyces cerevisiae 
BiP/GRP78, and Saccharomyces cerevisiae Hsp70. For further 
examples, see Gething and Sambrook, 1992, supra, and Hartl et 
al., 1994, supra. 

30 A processing protease is a protease that cleaves a 
propeptide to generate a mature biochemically active 
polypeptide (Enderlin and Ogrydziak, 1994, yeast 10: 67-79; 
Fuller et al . , 1989, Proceedings of the National Academy of 
Sciences USA 86: 1434-1438; Julius et al., 1984, Cell 37: 1075- 
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1089; Julius et al . , 1983, Cell 32: 839-852; U.S. Patent No. 
5,702,934). The nucleic acid sequence encoding a processing 
protease may be obtained from the genes encoding Saccharomyces 
cerevisiae dipeptidylaminopeptidase , Saccharomyces cerevisiae 
5 Kex2, Yarrowia lipolytica dibasic processing endoprotease 
(xpr6") , and Fusarium oxysporum metalloprotease (p45 gene) . 

In a more preferred embodiment, the nucleic acid sequence 
encoding the transcriptional activator is operably linked to a 
promoter, or a functional part thereof, which is stronger than 
io the corresponding promoter of the parent cell. In an even more 
preferred embodiment, the promoter, or a functional part 
thereof, mediates the expression of a gene encoding an 
extracellular protease, such as the Aspergillus oryzae alkaline 
protease, A. oryzae neutral metalloprotease, A. niger 
15 aspergillopepsin protease, Fusarium oxysporum trypsin-like 
protease or F. venenatum trypsin. 

The present invention also relates to a host cell useful for 
the production of a polypeptide wherein the host cell is a 
mutant of a parent fungal cell in which the mutant comprises 
20 a) a modification or inactivation of a transcriptional 
activator of the present invention, or a regulatory 
sequence thereof, and 
b) (i) an inducible promoter operably linked to a nucleic 
acid sequence encoding a transcriptional activator of the 
25 present invention, and (ii) a promoter sequence to which 

the transcriptional activator can bind, operably linked 
to a nucleic acid sequence encoding the polypeptide, 
wherein (i) and (ii) can be introduced simultaneously or 
sequentially. 

30 

The inactive form of the transcriptional activator in (a) 
above is obtained by inactivation or modification of a nucleic 
acid sequence present in the cell and necessary for the 
expression of the native transcriptional activator according to 
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any of the methods as disclosed supra. In a preferred 
embodiment the inactivation or modification is obtained by 
methods, which include, but are not limited to, one or more 
nucleotide insertions, deletions or substitutions, specific or 
5 random mutagenesis, gene replacement or gene interruption, and 
anti-sense techniques using a nucleotide sequence complementary 
to the nucleic acid sequence of the transcriptional activator. 
In another preferred embodiment, the inactive form of the 
native transcriptional activator is obtained by inactivation or 

10 modification of a control sequence required for the expression 
of the transcriptional activator. 

In another preferred embodiment, the nucleic acid sequence 
encoding the native transcriptional activator has the sequence 
set forth in SEQ ID NO:l or SEQ ID NO: 48. In another preferred 

15 embodiment, the transcriptional activator comprises the 
polypeptide having the amino acid sequence in SEQ ID NO:3. 

The inducible promoter sequence in (b) above may be any 
promoter sequence, or a functional part thereof, wherein the 
transcription initiation activity of the promoter can be 

20 induced according to the fermentation conditions. Preferably, 
the induction is mediated by a carbon or nitrogen catabolite. 
In a preferred embodiment, the promoter is the amdS promoter of 
Aspergillus nidulans or A. oryzae, the niaD promoter of A. 
nidulans, A. oryzae or A. niger, the niiA promoter of 

25 Aspergillus species, the alkaline phosphatase promoter of 
Aspergillus sp., the acid phosphatase promoter of Aspergillus 
sp., or the alcA promoter of A. niger. 

In another preferred embodiment, the host cell further 
comprises a promoter sequence, wherein the promoter sequence 

30 can be activated by the transcriptional activator and is 
operably linked to the nucleic acid sequence encoding the 
polypeptide. 

The promoter sequence activated by the transcriptional 
activator of the present invention may be any promoter 
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sequence, or a functional part thereof, selected from the group 
which includes but is not limited to promoters obtained from 
the genes encoding Aspergillus oryzae TAKA amylase, Rhizomucor 
miehel aspartic proteinase, Aspergillus niger neutral 
5 alpha- amylase, Aspergillus niger acid stable alpha-amylase, 
Aspergillus niger or Aspergillus awamori glucoamylase iglaA) , 
Rhizomucor miehei lipase, Aspergillus oryzae alkaline protease, 
Aspergillus oryzae triose phosphate isomerase, Aspergillus 
nidulans acetamidase, the NA2-tpi promoter (a hybrid of the 

10 promoters from the genes encoding Aspergillus niger neutral 
alpha-amylase and Aspergillus oryzae triose phosphate 
isomerase) , and mutant, truncated, and hybrid promoters 
thereof. . Particularly preferred promoters for use in 
filamentous fungal cells are a promoter, or a functional part 

15 thereof, from a protease gene; e.g., from the Fusarium 
oxysporum trypsin-like protease gene (U.S. Patent No. 
4,288,627), Aspegillus oryzae alkaline protease gene (alp), 
Aspergillus niger pacA gene, Aspergillus oryzae alkaline 
protease gene, A. oryzae neutral metal loprot ease gene, A. niger 

20 aspergillopepsin protease gene, or F. venenatum trypsin gene. 

In another preferred embodiment, the host cell harbours at 
least one copy of a nucleic acid sequence encoding a 
polypeptide . 

In another preferred embodiment, the host cell, which 
25 expresses the transcriptional activator of the present 
invention produces less of one or more native proteases than 
the parent cell when cultured under identical conditions. The 
protease (s) may be assayed using any of the methods described 
above. In a more preferred embodiment, an aliquot from a 48- 
30 hour culture media is incubated with 3 H-labelled sperm whale 
myoglobin at pH 4.0 and the radioactivity in the TCA-soluble 
fraction is measured (van Noort, J.M. , et al . , supra). 

The nucleic acid constructs described herein may be 
introduced into a parent fungal cell according to any of the 
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methods as described supra in the section, "Host Cells" to 
obtain a host cell useful for the production of a polypeptide. 
In a preferred embodiment the nucleic acid construct is 
integrated into the chromosome of the cell . In another 
5 preferred embodiment the nucleic acid construct is maintained 
as a self -replicating extra- chromosomal vector. 

It will be understood that the methods of the present 
invention are not limited to a particular order for obtaining 
the mutant fungal cell. The modification of the second nucleic 
10 acid sequence may be introduced into the parent cell at any 
step in the construction of the cell for the production of a 
polypeptide. 

Producing a Polypeptide 

15 Another aspect of the present invention relates to methods of 
producing a polypeptide in a host cell of the present invention, 
comprising: (a) cultivating the host cell which harbours a gene 
encoding the polypeptide in a nutrient medium suitable for 
production of the polypeptide; and (b) recovering the 

20 polypeptide from the nutrient medium of the host cell. 

In one embodiment, the host cell which is a mutant of a 
parent fungal cell in which the parent cell comprises one or 
more nucleic acid sequences encoding a protease, the 
transcription of which is activated by a transcriptional 

25 activator of the present invention, and the mutant cell 
produces less of the transcriptional activator and the 
protease (s) than the parent cell when cultured under the same 
conditions . 

In another embodiment, the host cell is a mutant of a parent 
30 fungal cell in which the mutant (a) produces more of the 
transcriptional activator of the present invention as compared 
to the parent cell when cultured under the same conditions; and 
(b) comprises a DNA sequence encoding the polypeptide, the 
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transcription of which is activated by the transcriptional 
activator. 

In another embodiment, the host cell is a mutant of a parent 
fungal cell in which the mutant comprises (a) a modification or 
5 inactivation of a transcriptional activator of the present 
invention or a regulatory sequence thereof, and (b) an 
inducible promoter operably linked to a nucleic acid sequence 
encoding a transcriptional activator of the present invention 
and a promoter sequence to which the transcriptional activator 

10 can bind, operably linked to a nucleic acid sequence encoding 
the polypeptide, wherein (i) and (ii) can be introduced 
simultaneously or sequentially. 

The host cells of the present invention are cultivated in a 
nutrient medium suitable for production of the polypeptide of 

15 interest using methods known in the art. For example, the 
cells may be cultivated by shake flask cultivation, small-scale 
or large-scale fermentation (including continuous, batch, fed- 
batch, or solid state fermentations) in laboratory or 
industrial fermentors performed in a suitable medium and under 

20 conditions allowing the polypeptide to be expressed and/or 
isolated. The cultivation takes place in a suitable nutrient 
medium comprising carbon and nitrogen sources and inorganic 
salts, using procedures known in the art (see, e.g., Bennett, 
J.W. and LaSure, L . , eds., More Gene Manipulations in Fungi, 

25 Academic Press, CA, 1991) . Suitable media are available from 
commercial suppliers or may be prepared using published 
compositions (e.g., in catalogues of the American Type Culture 
Collection) . If the polypeptide is secreted into the nutrient 
medium, the polypeptide can be recovered directly from the 

30 medium. If the polypeptide is not secreted, it is recovered 
from cell lysates. 

The resulting polypeptide may be isolated by methods known 
in the art. For example, the polypeptide may be isolated from 
the nutrient medium by conventional procedures including, but 
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not limited to, centrifugation, filtration, extraction, spray- 
drying, evaporation, or precipitation. The isolated 
polypeptide may then be further purified by a variety of 
procedures known in the art including, but not limited to, 
5 chromatography (e.g., ion exchange, affinity, hydrophobic, 
chromatofocusing, and size exclusion) , electrophoretic 
procedures (e.g., preparative isoelectric focusing, 
differential solubility (e.g., ammonium sulfate precipitation), 
or extraction (see, e.g., Protein Purification, J.-C. Janson 

10 and Lars Ryden, editors, VCH Publishers, New York, 1989) . 

The polypeptide may be detected using methods known in the 
art that are specific for the polypeptide. These detection 
methods may include use of specific antibodies, formation of an 
enzyme product, disappearance of an enzyme substrate, or SDS- 

15 PAGE. For example, an enzyme assay may be used to determine 
the activity of the polypeptide. Procedures for determining 
enzyme activity are known in the art for many enzymes. 

In the methods of the present invention, the host cell 
produces at least about 20% more, preferably at least about 50% 

20 more, more preferably at least about 100% more, even more 
preferably at least about 200% more, and most preferably at 
least about 3 00% more of the polypeptide than a corresponding 
parent cell when cultivated under the same conditions. 

The polypeptide may be any polypeptide whether native or 

25 heterologous to the mutant filamentous fungal cell. The term 
"heterologous polypeptide'' is defined herein as a polypeptide, 
which is not produced by a cell. The term "polypeptide" is not 
meant herein to refer to a specific length of the encoded 
produce and therefore encompasses peptides, oligopeptides and 

30 proteins. The polypeptide may also be a recombinant 

polypeptide, which is a polypeptide native to a cell, which is 
encoded by a nucleic acid sequence, which comprises one or more 
control sequences, foreign to the nucleic acid sequence, which 
are involved in the production of the polypeptide. The 
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polypeptide may be a wild-type polypeptide or a variant 
thereof. The polypeptide may also be a hybrid polypeptide, 
which contains a combination of partial or complete polypeptide 
sequences obtained from at least two different polypeptides 
5 where one or more of the polypeptides may be heterologous to 
the cell. Polypeptides further include naturally occurring 
allelic and engineered variations of the above-mentioned 
polypeptides . 

In a preferred embodiment, the polypeptide is an antibody or 
10 portions thereof, an antigen, a clotting factor, an enzyme, a 
hormone or a hormone variant, a receptor or portions thereof, a 
regulatory protein, a structural protein, a reporter, or a 
transport protein. 

In a more preferred embodiment, the enzyme is an 
15 oxidoreductase, transferase, hydrolase, lyase, isomerase, or 
ligase . 

In an even more preferred embodiment, the enzyme is an 
aminopeptidase , amylase , carbohydrase , carboxypept idase , 
catalase, cellulase, chitinase, cutinase, deoxyribonuclease, 

20 dextranase, esterase, alpha-galactosidase, beta-galactosidase, 
glucoamylase , alpha-glucosidase , beta-glucosidase , 

haloperoxidase, invertase, laccase, lipase, mannosidase, 
mutanase, oxidase, pectinolytic enzyme, peroxidase, phytase, 
polyphenoloxidase, proteolytic enzyme, ribonuclease, 

25 transglutaminase, or xylanase. 

In another even more preferred embodiment, the polypeptide 
is human insulin or an analog thereof, human growth hormone, 
erythropoietin, or insulinotropin. 

The nucleic acid sequence encoding a heterologous 

30 polypeptide may be obtained from any prokaryotic, eukaryotic, 
or other source. For purposes of the present invention, the 
term "obtained f rom" as used herein in connection with a given 
source shall mean that the polypeptide is produced by the 
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source or by a cell in which a gene from the source has been 
inserted. 

In the methods of the present invention, the mutant 
filamentous fungal cells may also be used for the recombinant 
5 production of polypeptides, which are native to the cell. The 
native polypeptides may be recombinantly produced by, e.g., 
placing a gene encoding the polypeptide under the control of a 
different promoter to enhance expression of the polypeptide, to 
expedite export of a native polypeptide of interest outside the 

10 cell by use of a signal sequence, and to increase the copy 
number of a gene encoding the polypeptide normally produced by 
the cell. The present invention also encompasses, within the 
scope of the term "heterologous polypeptide" , such recombinant 
production of polypeptides native to the cell, to the extent 

15 that such expression involves the use of genetic elements not 
native to the cell, or use of native elements which have been 
manipulated to function in a manner that do not normally occur 
in the filamentous fungal cell. The techniques used to isolate 
or clone a nucleic acid sequence encoding a heterologous 

20 polypeptide are known in the art and include isolation from 
genomic DNA, preparation from cDNA, or a combination thereof. 
The cloning of the nucleic acid sequences from such genomic DNA 
can be effected, e.g., by using the well known polymerase chain 
reaction (PCR) , see, for example, Innis et al. t 1990, PCR 

25 Protocols: A Guide to Methods and Application, Academic Press, 
New York. The cloning procedures may involve excision and 
isolation of a desired nucleic acid fragment comprising the 
nucleic acid sequence encoding the polypeptide, insertion of 
the fragment into a vector molecule, and incorporation of the 

30 recombinant vector into the mutant fungal cell where multiple 
copies or clones of the nucleic acid sequence will be 
replicated. The nucleic acid sequence may be of genomic, cDNA, 
RNA, semisynthetic, synthetic origin, or any combinations 
thereof . 
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In the methods of the present invention, heterologous 
polypeptides may also include fused or hybrid polypeptides in 
which another polypeptide is fused at the N-terminus or the C- 
terminus of the polypeptide or fragment thereof. A fused 
5 polypeptide is produced by fusing a nucleic acid sequence (or a 
portion thereof) encoding one polypeptide to a nucleic acid 
sequence (or a portion thereof) encoding another polypeptide. 
Techniques for producing fusion polypeptides are known in the 
art, and include, ligating the coding sequences encoding the 

10 polypeptides, so that they are in frame and expression of the 
fused polypeptide is under control of the same promoter (s) and 
terminator. The hybrid polypeptides may comprise a combination 
of partial or complete polypeptide sequences obtained from at 
least two different polypeptides wherein one or more may be 

15 heterologous to the mutant fungal cell. An isolated nucleic 
acid sequence encoding a heterologous polypeptide of interest 
may be manipulated in a variety of ways to provide for 
expression of the polypeptide. Expression will be understood 
to include any step involved in the production of the 

20 polypeptide including, but not limited to, transcription, post- 
transcriptional modification, translation, post-translational 
modification, and secretion. Manipulation of the nucleic acid 
sequence encoding a polypeptide prior to its insertion into a 
vector may be desirable or necessary depending on the 

25 expression vector. The techniques for modifying nucleic acid 
sequences utilizing cloning methods are well known in the art. 

The present invention is further described by the following 
examples, which should not be construed as limiting the scope 
of the invention. 
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EXAMPLES 
Materials 

Chemicals used as buffers and substrates were commercial 
5 products of at least reagent grade. 

Strains 

AB4.1: a strain of Aspergillus niger which is a cspAl pyrGl 
derivative of strain ATTC 9029 (van Hartingsveldt , W. , et al., 

10 1987. Mol. Gen. Genet. 206:71-75; Bos, C.J., et al., Curr. 
Genet. 14:437-443) 

AB1.13: a protease deficient strain of Aspergillus niger 
derived from UV mutagenesis of AB4.1 (Mattern, I.E., et al., 
1992. Mol. Gen. Genet. 234:332-336) 

15 13PAP2: an AB1.13 derivative containing multiple copies of 
the A. nidulans amdS gene (Corrick, R. A. , et al, 1987. Gene 
53: 63 - 71) under control of the pepA promoter of A. niger 
(Jarai G. and Buxton F. 1994. Curr Genet 26:238-244). The 
strain has a protease deficient phenotype and is unable to grow 

20 on medium containing acetamide as the sole nitrogen source. 
Strain 13PAP2 has been deposited at DSM under the name DSM No. 
12298. 

4PAP6: an AB4 . 1 derivative containing multiple copies of the 
of A. nidulans amdS gene under control of the pepA promoter of 
25 A. niger. The strain does not have a protease deficient pheno- 
type and is able to grow on medium containing acetamide as the 
sole nitrogen source. 

N402: a strain of Aspergillus niger, deposited at the ATCC 
(Manassas VA, USA) as ATCC Number: 64974 
30 MC1046: a strain of E. coli, deposited at the ATCC as ATCC 
Number: 35467 

A.oryzae IF04177: available from Institute for Fermenta- 
tion, Osaka; 17-25 Juso Hammachi 2-Chome Yodogawa-Ku, Osaka, 
Japan . 



WO 01/68864 



PCT/DK01/00169 



43 

HowBlOl: described in WO 97/35956. 
Plasmids 

pPAP: constructed as described below in Example 1 and 
5 shown in Figure 1 

pAopyrGcosArpl : constructed as described below in Exam- 

ple 1 and shown in Figure 2 

pEESl: constructed as described below in Example 1 and 
shown in Figure 3 
io p3SR2: contains the A. nidulans amdS gene as described by 
CM. Corrick, A. P. Twomey, and M.J. Hynes (1987. Gene 53: 63- 
71) 

pABPYRG*-Not : contains an inactivated pyrG gene as 

described by Verdoes, J.C., et al . (1994. Gene 145: 179-187) 
15 pHelpl : contains the pyrG gene from A. oryzae as a selec- 
tive marker and the AMA1 sequences which enable autonomous rep- 
lication in A. niger, cloned into the E. coli vector pIC20R, as 
described by Gems, D., et al. (1991. Gene 98: 61-67) 

pAnscosl : contains two cos sites as described by 

20 Osiewacz, H.D. (1994. Curr. Genet. 26: 87-90) 

pA04-2: contains the A, oryzae pyrG gene as described by De 
Ruiter- Jacobs, Y.M.J.T., et al. (1989. Curr. Genet. 16: 159- 
163) 

pA04-13: contains the A. oryzae pyrG gene as described by De 
25 Ruiter- Jacobs, Y.M.J.T., et al . (1989. Curr. Genet. 16:159- 
163) 

pUC19: as described by Yanisch-Perron, C, Vieira, J. and 

Messig, J. (1985, Gene 33:103-119) 

pDV8: described in Example 8 and shown in Fig. 7. 
30 pJaL554: described in Example 8 and shown in Fig. 8 

Deposit of Biological Materials 

The following biological material has been deposited under 
the terms of the Budapest Treaty with the Deutsche Sammlung von 
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Microorganismen und Zellkulturen GmbH, Mascheroder Weg lb, D- 

38124 Braunschweig, Germany and given the following accession 
number : 

Deposit Accession Number Date of Deposit 

5 Escherichia coli f pEES DSM 12294 1998-07-14 

Aspergillus niger 13PAP2 DSM 12298 1998-07-14 

The strain has been deposited under conditions that assure 
that access to the culture will be available during the 

10 pendency of this patent application. The deposit represents a 
substantially pure culture of the deposited strain. The 
deposit is available as required by foreign patent laws in 
countries wherein counterparts of the subject application, or 
its progeny are filed. However, it should be understood that 

15 the availability of a deposit does not constitute a license to 
practice the subject invention in derogation of patent rights 
granted by governmental action. 

EXAMPLES 

20 

Example 1 

Cloning of the A. niger prtT transcriptional activator 

The prtT gene was cloned from 13PAP2, an A. niger mutant strain 
which is unable to express the amdS gene regulated by the pepA 
25 protease gene promoter and has a protease deficient phenotype 
(prf) . 

Construction of the A. niger 13PAP2 reporter strain 

The plasmid pPAl was contructed by ligation of the following 
30 three fragments: 

1) the E. coli vector pBlueScript II SK (Stratagene Cloning 
Systems, La Jolla CA, USA) digested with EcoRI and Kpnl; 

2) a 1.4 kb EcoRI /BamHI restriction fragment containing the 
1.2 kb promoter region of the pepA gene linked to about 
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130 bp of the amdS coding sequence from the start codon 
to an internal BamHI site, amplified by PCR; and 
3) a 2.1 kb BamHl/Kpnl fragment from p3SR2 which contains 
most of the A. nidulans amdS gene 

5 

Fragment 2 was constructed in two steps. In a first step 
genomic DNA from A. niger N402 prepared from protoplasts as de- 
scribed below in the section "Construction of the Cosmid Li- 
brary" was used as the template, and the two oligonucleotides 
10 shown below, pepApr and pepA/amdS, were used as primers: 

PepApr: CGG AAT TCG GAT GCT GGA GGT GCT TCT AA (SEQ ID NO: 6) 

pepA/amdS:TTC CCA GGA TTG AGG CAT TTT GAC CAC GAG AAT (SEQ ID NO: 

15 

The 12 00 bp PCR product obtained from this reaction was then 
used as a primer in a second PCR reaction together with the 
oligonucleotide MBL1213 shown below, and plasmid p3SR2 as the 
template. 

20 

MBL1213: TAA CTT CCA CCG AGG TC (SEQ ID NO: 8) 

The product obtained by ligation of the three fragments de- 
scribed above was subsequently transfected into E. coli DH5a. 

25 In the final construction procedure, pPAl was digested with 
Not I and ligated to a 3.8 kb'Notl fragment from pABPYRG* -Not , 
resulting in plasmid pPAP which is shown in Fig. 1. pPAP was 
transformed into A. niger AB 1.13, and a transformant with pPAP 
integrated into the pyrG locus in multicopy was isolated. A 

30 spontanous 5 flourotic acid (FOA) resistant, uridine -requiring 
mutant of this transformant that could be complemented with the 
pyrG gene was named 13 PAP 2. 
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Construction of pAopyrGcosArpl 

The plasmid pAopyrGcosArpl was constructed by ligation and 
subsequent transfection into E. coli DH5a of the following 
three fragments: 
5 1) the E. coli vector pHelpl cut with Acc65I and BamHI 

2) a 3.0 kb BamHI/Hindlll fragment from pAnscosl containing 
two cos sites 

3) a 3.2 kb Acc65I/HindIII fragment from pA04-2 containing 
the A. oryzae pyrG gene 

10 

The resulting plasmid, pAopyrGcosArpl, is self -replicating 
in Aspergilli and can be selected for by growth on medium lack- 
ing uridine. pAopyrGcosArpl is depicted in Fig. 2. 

15 Construction of the cosmid library. 

A cosmid library of Aspergillus niger was constructed using 
the "SuperCosl cosmid vector kit" (Stratagene Cloning Systems, 
La Jolla CA, USA) according to the supplier's instructions. 

Genomic DNA from A. niger N402 was prepared from protoplasts 

20 made by standard procedures. 

After isolation the protoplasts were pelleted by centri- 
fugation at 2000 rpm for 10 minutes in a Beckman GS-6R; the 
pellet was then suspended in a buffer containing 22.5 mM tri- 
isonaphtalene sulphonic acid, 275 mM para-aminosalicylic acid, 

25 0.2 M Tris-HCl (pH 8.5), 0.25 M NaCl and 50 mM EDTA immediately 
followed by addition of 1 volume of phenol /chloroform (1:1). 
After careful mixing and centrifugation at 3000 rpm for 2 0 min- 
utes the aqueous phase was decanted and DNA was precipitated 
using standard procedures . 

3 0 The size of the genomic DNA was analysed by electrophoresis 
on a 0.3% agarose gel run for 20 hours at 3 0 volts at 4°C. The 
ethidium bromide stained gel showed that the recovered DNA 
ranged in size from 50 to greater than 100 kb. The DNA was 
partially digested using Mbol. The size of the digested DNA was 
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30 to 50 kb as determined by the same type of gel analysis as 
above. The pAopyrGcosArpl vector, purified using a kit from 
QIAGEN (Venlo, The Netherlands) following the manufacturer's 
instructions, was digested with BamHI, dephosphorylated and gel 
5 purified. Ligation and packaging were - performed following 
standard procedures . 

After titration of the library, all of the packaging mix 
from a single ligation and packaging was transfected into the 
host cell, MC1046, and plated on 50 fig/ml ampicillin LB plates. 

10 Approximately 40,000 colonies were obtained. Cosmid prepara- 
tions from 10 colonies showed that they all had inserts of the 
expected size. The 40,000 colonies were then soaked in LB me- 
dium and scraped off of the plates, then aliquoted for storage 
in 15% glycerol at -80°C. This represents an approximate 40- 

15 fold amplification of the A. niger genome. 

Selection of A. niger prtT clones 

Cosmid DNA was prepared from the library and introduced into 
13PAP2 according to the transformation procedure described by 

20 P.J. Punt and C.A.M. J. J . Van Den Hondel (1992. Methods Enzymol 
216: 447-457). Repeated efforts to select for the pyrG marker 
only resulted in a recovery of between 4000 to 3 0,000 transfor- 
mants. A double selection for the pyrG marker and growth on 
medium containing acetamide as the sole nitrogen source re- 

25 suited in a total of 65 primary transf ormants from five differ- 
ent experiments. 

Each primary trans formant was screened for protease activ- 
ity, growth on medium containing acetamide as the sole nitrogen 
source and instability of the these two characteristics. An 

30 acetamidase + phenotype, screened by growth on medium containing 
acetamide, is an indication of acetamidase activity resulting 
from activation of the pepA promoter in the reporter cassette 
in which the pepA promoter is linked to the amdS coding se- 
quence. A protease + phenotype was screened using minimal me- 
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dium plates containing ciialyzed skim milk as the sole nitrogen 
source (Mattern, I.E., etal., 1992, Mol Gen Genet 234:332- 
336). On these plates the wild-type AB4.1 strain makes a clear 
halo whereas the AB1.13 mutant produces a very small halo. This 
5 difference is not due to differences in the activity of pepA 
since a pepA deleted strain can also produce a . large halo on 
these plates. Therefore, a large halo on milk plates indicates 
activation of other extracellular proteases. 

Instability was tested by growing diluted spore stocks on a 

10 medium containing uridine. Single-spore-derived colonies were 
•picked from these plates and tested for protease activity and 
growth on acetamide. The screening results revealed that in 
more than 70% of the colonies both characteristics were lost. 
Therefore, the two phenotypes were either lost or retained to- 

15 gether, indicating that activation of the pepA promoter and 
other protease promoters is coordinately regulated and linked 
to the presence of the pyrG marker. The gene responsible for 
this phenotype was named prtT. Twelve acetamidase*, protease* 
transformants were then isolated. 

20 

Isolation of the A.niger prtT gene 

In order to rescue the prtT gene from the acetamidase + , pro- 
tease 4 transformants of 13PAP2, DNA was prepared from mycelium 
grown in minimal medium as previously described. This DNA was 

25 used in an attempt to transform competent E. coli DHSoc cells. 
Several hundreds of ampicillin-resistant colonies were ob- 
tained. DNA analysis showed they all contained sequences de- 
rived from the pHelpl plasmid. Cosmid DNA isolated from E. 
coli colonies was then retransf ormed into 13PAP2 . Two DNA sam- 

30 pies gave rise to transformants, which showed both growths on 
acetamide containing medium and increased protease activity. 
DNA from one of the cosmids, ACR1, was then digested with sev- 
eral restriction enzymes. The resulting fragments were then 
co-transformed with pAopyrGcosArpl into strain 13PAP2. EcoRI, 
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PstI, BamHI and Kpnl digestion of ACR1 gave rise to transfor- 
mants capable of growth on acetamide and high protease activ- 
ity, whereas Sail and Hindlll digests did not. Because EcoRI 
digestion gave the simplest pattern, separate EcoRI fragments 
5 were gel-isolated and with pAopyrGcosArpl used to co-transform 
13PAP2. Only one fragment, a 15 kb EcoRI fragment, gave rise 
to transformants capable of growth on acetamide-containing me- 
dium: This fragment was subcloned in pBluescript II SK in or- 
der to subclone prtT from the cosmid. Since the insert of this 

10 clone was still rather large, separate PstI bands were gel iso- 
lated and each was co- transformed . with pAopyrGcosArpl into 
13PAP2. Only one band, a 2.5 kb PstI fragment, gave rise to 
transformants that could grow on acetamide-containing medium. 
This fragment was subcloned in pBlueScript II SK. Four sub- 

15 clones, ClE 0.7, ClE 1.8, NcE 1.1 and NcE 1.4, were constructed 
from this plasmid based on the restriction map. In addition, a 
6.5 kb Sstl/EcoRI fragment encompassing the 2.5 kb PstI frag- 
ment was subcloned, resulting in pEESl (shown in Fig. 3) . 

Southern blot analysis of genomic DNA from AB4 . 1 showed the 

20 presence of only one copy of prtT. 

Example 2 

Sequencing of the A. niger prtT gene and analysis of the se- 
quence 

25 All sequence reactions were prepared using dRhodamine Termi- 
nator Cycle Sequencing Kits or BigDye™ Terminator Cycle Se- 
quencing Kits from the Perkin-Elmer Corporation (Branchburg NJ, 
USA) . The reactions were run on an ABI PRISM* 3 77 DNA Se- 
quencer (Perkin-Elmer Corporation) following the manufacturer's 

30 instructions. 

The prtT gene .was sequenced from the genomic clones ClE 0.7, 
ClE 1.8, NcE 1.1, NcE 1.4 and pEESl. The sequence specific 
primers used are listed below: 

122958: CGA TCG ATG ACT GCC TGT (SEQ ID NO: 9) 
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122956 : 


AGA 


GAC 


ACA 


TAG 


TGP 


PTT 

v- X X 


V O Lli\J 1JJ 1NW . 


xu ; 




122959 : 


GCT 


TAT 


AGT 


CGA 


TAG 


pftp 


f QPO TD TvTO • 

\OH\^ JLU 1M\J , 


xx ; 




122960 • 


CCT 


CTC 


TCC 


AGP 


GAT 

vj.ri X 


VJO X 


( QTTn T"n TvTO » 
V OCiy 1JJ IMVJ ; 


xz ; 




122962 • 

-1» fcJ £j »✓ W ■ 


ATG 


GAA 


TAP 


ATA 


PTG 


PTT 


voJzjv.j xu : 


x j ; 


5 


122961 : 


ATG 


AAA 


ppp 


APT 




nr*T 

vjLX 


\ oxls^ xu : 






122963 : 


TGP 


TOG 


ATA 


AGP 




TPP 


(SEQ ID NO: 


15) 




122964 : 


AAT 


CTT 


ATG 


GAP 


PPG 


PTT 


(SEQ ID NO: 


16) 




124289 : 


CCC 


CGG 


GAA 


ACA 


AGA 


APA 


GG (SEQ ID 


NO: 17) 




124290 : 


GTT 


GGC 


GGA 


CCT 


TGA 


PTA 


TG (SEQ ID 


NO: 18) 


XU 


xz d x xz : 




LjU 1 


ALA 


UxCj 




TTC 


ATC T (SEQ 


ID NO: 19) 




125111: 


AGT 


CAA 


CGG 


GGG 


AAG 


TCT 


C (SEQ ID NO: 20) 




128330: 


CTA 


GCA 


GCG 


TAT 


CGG 


TCA 


GC (SEQ ID 


NO: 21) 




130887: 


CTT 


GGA 


AAA 


GAA 


ACG 


ATA 


G (SEQ ID NO: 22) 




130888 : 


AAC 


GTA 


CGC 


TTT 


CCT 


CCT 


T (SEQ ID NO: 23) 


15 


134135: 


GGG 


TCC 


GTC 


CAG 


TCC 


GTT 


CTT (SEQ ID 


NO: 24) 



-48 reverse: ' AGC GGA TAA CAA TTT CAC ACA GGA (SEQ ID. NO: 

25) 

-40 universal: GTT TTC CCA GTC ACG AC (SEQ ID NO: 26) 

20 

A mutant allele of the gene was obtained by PCR amplifica- 
tion of genomic DNA isolated from the mutant strain AB1.13 us- 
ing the following primers: 

25 PstI: TC ATC CCT GGT GTT ACT GC (SEQ ID NO: 27) 

Pstll: C ATG GAT TGG CTG GCC G (SEQ ID NO: 28) 

The complete DNA sequence of the prtT gene is shown in SEQ 
ID N0:1. The sequence of the PCR fragment of the mutant allele 
30 is shown in SEQ ID NO:4. 

Analysing the DNA sequence SEQ ID NO:l using the computer 
software Net gene 2 (S.M. Hebsgaard, P.G. Korning, N. Tolstrup, 
J. Engelbrecht, P. Rouze, S. Brunak (1996. Nucleic Acids Re- 
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search 24: 3439-3452) suggested the existence of 5 exons (see 
annotations to SEQ ID NO 1) . 

Analysis of the A. niger prtT cDNA 

5 mRNA was purified from total RNA (isolated according to the 
DNA isolation method described above in Example 1) using a com- 
mercial poly (A) + RNA isolation kit (Pharmacia, Uppsala SE) from 
a culture of A. niger grown under conditions favourable for 
protease production (J.P.T.W. Van Den Hombergh, et al., 1997. 
10 Eur. J. Biochem. 247:605-613). Double stranded cDNA was pre- 
pared using standard procedures and used for PCR reactions with 
the following primers: 

oligo-dT primer: T20N 
15 Prt270n: TACTCTCCAGATTGCCTG (SEQ ID NO: 2 9) 

Prtl420r: TGAGATACCACTCAGCAG (SEQ ID NO: 30) 

prtl350n: TGCACTTCTCTGTCTCTG (SEQ ID NO : 31) 

Prt2365r: GACTTCTGGCATCAGTTG (SEQ ID NO*: 32) 

prt232 0n: CTCATGGATGGCATGATC (SEQ ID NO: 33) 

20 

A PCR reaction with the primers Prt270n and Prtl420r pro- 
duced a fragment of approximately 1.0 kb. The fragment was 
cloned into a pGEM-T vector (Promega Corp., Madison WI, USA), 
and the insert in the resulting plasmid was sequenced using the 
25 primers 122958, 122960, -40 universal and -48 reverse. The re- 
sult confirmed the presence of two introns in this part of the 
gene. 

A second PCR reaction with the primers Prtl350n and Prt2365r 
produced a fragment of approximately 0.9 kb. This fragment was 
30 also cloned in a pGEM-T vector, and the insert in the resulting 
plasmid was sequenced using the primers 124289, 124290, -40 
universal and -48 reverse. The result confirmed the presence 
of a single .intron in this part of the gene. 
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Another PCR reaction with the oligo-dT primer and primer 
Prt2320n produced a fragment of approximately 350 bp. This 
fragment was also cloned in a pGEM-T vector. Sequencing of the 
insert using primers -40 universal and -48 reverse showed that 
5 the fragment contained the 3' part of prtT and confirmed the 
presence of another intron. 

The deduced protein sequence of the translated prtT gene is 
shown in SEQ ID NO:2. The deduced protein sequence of the 
translated mutant allele prtl3 is shown in SEQ ID NO: 5. A com- 

10 parison of SEQ ID NO: 2 and SEQ ID NO: 5 indicates that the only 
difference between the two is in position 112 where the leucine 
residue in the translated prtT gene is replaced by ■ proline in 
the translated prtl3 gene. 

Analysis of the deduced PrtT protein sequence reveals the 

15 presence of a Zinc (II) 2Cys6 binuclear cluster DNA binding motif 
(SEQ ID NO:2, residues 47-81). This motif defines the GAL4 
class of fungal transcriptional activators (Reece, M. J., and 
Ptashne, M. 1993. Science 261: 909-911). The presence of the 
motif in the prtT gene strongly indicates that prtT is a tran- 

20 script ional activator. 

EXAMPLE 3: 

Disruption of the prtT gene in a wild- type A. niger strain 

A plasmid was constructed in which the upstream and down- 
25 stream sequences of the prtT gene are separated by the A. 
oryzae pyrG gene. Plasmid pEESl was digested with Muni and 
Nhel, which removed a 2.1 kb fragment containing most of the 
coding sequence of prtT. A 2.3 kb EcoRI/Nhel fragment from 
pA04-13 containing the A. oryzae pyrG gene was cloned in the 
30 Muni and Nhel sites of pEESl. The resulting plasmid, shown in 
Pig. 4, was named pDprt. This construct was then used to 

transform A. nigrer strain AB4 . 1 to uridine prototrophy. About 
150 uridine prototrophic transformants were then analyzed for 
protease activity on skim milk containing plates. Five of 
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these did not make a halo on these plates indicating that pro- 
tease activity was very low. Comparison of strains with a dis- 
rupted prtT gene and the mutant AB1.13 strain did not show any 
differences in protease activity or phenotype. 

5 

Example 4 

Overexpression of A, niger PrtT 

A plasmid, pGPprt, (Figure 5) containing the coding region 
10 and 3 1 noncoding sequences of prtT fused to the promoter of the 
A. niger gpd gene was constructed. The gpd gene codes for 
glyceraldehyde- 3 -phosphate dehydrogenase and a constitutively 
expressed enzyme involved in primary metabolism. The promoter 
used was a fragment upstream of the coding region. 
15 The plasmid is transformed into A. niger AB4 . 1 by cotransf orma- 
tion with the pyrG selection plasmid pA04-13. Transf ormants 
with increased prtT transcription as determined by Southern 
blot analysis is analysed for increased protease expression. 

20 Example 5 

Isolation of the Zn 2+ - finger from the A. oryzae prtT gene 

The A. niger prtT gene is shown in SEQ ID NO: 1. The pro- 
tein sequence deduced from the DNA sequence of prtT (SEQ ID 
NO: 2) contains a so called Zn 2+ - finger motif expected to be 

25 responsible for the DNA binding of the transcriptional activa- 
tor encoded by prtT. The Zn 2+ -finger motif has the following 
amino acid sequence: Met Thr Ala Cys His Thr Cys Arg Lys Leu 
Lys Thr Arg Cys Asp Leu Asp Pro Arg Gly His Ala Cys Arg Arg Cys 
Leu Ser Leu Arg lie Asp Cys (SEQ ID NO: 34) . 

30 Degenerate primers able to code for amino acid sequences 

from the motif were designed and synthesized by DNA Technology 
A/S, Forskerparken, Gustav Wieds vej 10, DK-8000 Aarhus C, Den- 
mark. The primers had the following sequences: 
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137396: A T G A C C/T G C C/T T G C/T C A C/T A C C/T T G (SEQ 
ID NO: 35) 

137397: A A/G A/G C A A/G/C/T C G A/G/C/T C G A/G C A A/G G C 
A/G T G (SEQ ID NO: 36) 
5 The primers were used in a PCR reaction with genomic A. 

oryzae IF04177 DNA as template. The reaction was performed in a 
total volume of 100 \il containing 154 pmol of primer 137396 and 
10164 pmol of primer 13 73 97. 3 0 PCR cycles with 56°C as anneal- 
ing temperature and 30 seconds elongation time were run. An- 
10 other PCR reaction using A. niger genomic DNA and the primers 
137394: ATGACTGCCTGTCACACATG (SEQ ID NO: 37) ' 

and 137395: AGACAGCGACGGCACGCATG (SEQ ID NO: 38), which are 
specific for the A. niger prtT gene, was also run. In this re- 
action 10 pmol of each primers was used in a 100 fil reaction. 

15 Aliquots of the two reactions were applied to a 3% agarose gel. 
After electrophoresis three approximately equally intense bands 
could be seen in the A. oryzae reaction and two bands in the A. 
niger reaction. One of the bands in the A. niger reaction was 
more intense than the other and further had the expected size. 

20 One of the A. oryzae bands had the same size as the most in- 
tense A. niger band and was isolated from the gel. The fragment 
was cloned into the vector pCR2.1 (Invitrogen™) . Plasmids from 
two individual colonies were sequenced. The sequences are shown 
in figure 1. The two sequences differ at the end reflecting 

25 their origin in different degenerate primers. They are identi- 
cal in the middle 40bp, which are amplified from the genomic 
DNA. These 4 0 basepairs encode a polypeptide identical to a 
part of the Zn 2+ -finger of the A. niger prtT gene. 

30 Example 6 

Isolation of the N- terminal of the A. oryzae prtT gene 

The inverse PCR method was used to isolate the A. oryzae 
prtT gene. The primers 144428: CACCGAGTTTTAAGCTTGCGG (SEQ ID 
NO: 39) and 144429: GCGATCTTGATCCACGAGGG (SEQ ID NO: 40) were 
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synthezised by DNA Technology A/S (Denmark) . Genomic DNA was 
cut with a number of restriction enzymes and religated. The 
ligation mixtures were used as templates in PCR reactions with 
the primers 144428/144429, In a reaction with BamHI restricted 
5 and religated DNA as template a fragment of approximately 2.5kb 
was observed after electrophoresis on an agarose gel. The frag- 
ment was labelled with 32 P by the random priming method and 
used as a probe against a filter containing a gridded cosmid 
library of genomic A. oryzae DNA. The construction of the li- 

10 brary is described in WO 98/01470. The cosmid 11P8 showed a 
positive hybridization signal with the probe. A Southern blot 
containing DNA from 11F8 and genomic DNA restricted with BamHI, 
EcoRI, PstI or Xhol was probed with the 2.5kb inverse PCR frag- 
ment. The size of hybridizing bands from genomic DNA were com- 

15 pared with those from the cosmid DNA. Apparently some rear- 
rangement of the cosmid had occurred since only a minority of 
the bands from the genomic DNA had counterparts in the cosmid. 
Two hybridizing fragments from the cosmid, a 1.2 kb EcoRI frag- 
ment and a 1.0 kb PstI fragment, looked equal in size to hy- 

20 bridizing genomic fragments. The two fragments were sub-cloned 
from the cosmid and sequenced. Analysis of the sequence data 
showed that the fragments overlap. In total 1497 bp of sequence 
was obtained. Oligonucleotides encoding the Zn 2+ - finger were 
not contained within the sequence. A BamHI site was found close 

25 to one end of the sequence in a region only covered by the 
EcoRI sub-clone, thus allowing the position of the sequenced 
genomic fragment relative to the Zn 2+ -finger to be determined. 
The primer 153468: CGGGATGAATTGTAGAGAGGC (SEQ ID NO: 41) was 
prepared by DNA Technology A/S (Denmark) . The primer sequence 

30 is contained within the 1497 bp fragment. It is found at the 
end closest to the Zn 2+ - finger and points in that direction. 
Two primers both of prtT Zn 2+ - finger specific sequence and 
pointing either downstream (140358) or upstream (140359) were 
also prepared by DNA Technology A/S (Denmark) . The sequence of 
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the two primers are as follows: 140358: 

CGCAAGCTTAAAACTCGGTGCGATC (SEQ ID NO: 42) ' and 140359: 
CCTCGTGGATCAAGATCGCA (SEQ ID NO: 43) . Two PCR reactions, one 
with the primers 153468 and 140358 and one with 153468 and 
5 140359, respectively, were performed with genomic DNA as tem- 
plate. The reaction with 153468 and 140359 gave a band of ap- 
proximately 1.1 kb, the other reaction gave no visible bands, 
when analysed on an agarose gel. The 1.1 kb fragment was cloned 
into pCR4Blunt-TOPO (Invitrogen) and sequenced. The fragment 
10 contained part of the Zn 2+ - finger and overlaps with the 14 97bp 
fragment. Translation of the sequence showed that the region 
immediately upstream of the Zn 2+ -finger encodes a polypeptide 
with homology to the N- terminal of prtT from A. niger. 

15 Example 7 

Isolation of the complete A. oryzae prtT gene 

The remaining parts of the gene were cloned by two con- 
secutive rounds of inverse PCR. In the first inverse PCR reac- 
tion the genomic DNA was restricted with EcoRV and re-ligated. 

20 The PCR reaction was run with the primers 175653: 
GATGAAAAGAATAATCGGCGAG (SEQ ID NO: 44) and 175654: 
CGCGGCACACTACCCCCGTTG (SEQ ID NO: 45). The reaction resulted in 
the synthesis of a 1.9kb fragment, which was cloned into the 
pCR4Blunt-TOPO vector and sequenced. Analysis of the sequence 

25 data showed that the fragment contains a gene with homology to 
the A. niger prtT gene and that the 3 "end of the gene was miss- 
ing. The second inverse PCR reaction was thus performed. The 
primers were B0403G08: ATCTAGCTCAAGCATTAGCGGC (SEQ ID NO: 46) 
and B0403G09: AATTTCGGCCCTTTAGTGTCC (SEQ ID NO: 47). Bglll re- 

30 stricted and re-ligated genomic DNA was used as template. A 2.4 
kb fragment was obtained and cloned into the pCR4Blunt-TOPO 
vector and sequenced. Analysis of the sequence showed that the 
complete A. oyzae prtT gene had been obtained. The DNA se- 
quence of the A. oryzae prtT gene is shown in SEQ ID NO: 48 
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and the deduced amino acid sequence of the encoded protein is 
shown in SEQ ID NO: 49. 

Example 8 

5 Disruption of the Aspergillus oryzae prtT gene. 

The A. oryzae prtT gene was disrupted using a method of 
positive/negative selection. A disruption cassette consisting 
of 2kb of the A. oryzae prtT gene (SEQ ID NO: 48) with an in- 
sertion of the pyrG gene in the middle is cloned into a vector 

10 (pDV8) containing the herpes simplex virus I tymidine kinase 
gene (HSV-tk) flanked by fungal expression signals. Expression 
of the tymidine kinase gene makes the host sensitive to 5- 
flouro-2-deoxyuridine. A disrupted strain can be isolated by 
positive selection for the pyrG gene in a pyrG minus host and 

15 deselection of the tymidine kinase gene on 5-flouro-2- 
deoxyuridine . Since the tymidinie kinase gene and the pyrG gene 
are present in the same DNA fragment selection is for transfor- 
mants in which a doubble cross-over event has happened. The 
system gives fewer transf ormants pr. Micro g of DNA than trans- 

20 formation with just a disruption cassette, but the frequency of 
transformants in which the desired homologous recombination 
event has occurred is much higher. 

The pyrG gene used here is flanked by repeats enabling a later 
removal by selection for 5-f luoroorotic acid resistance. The 
25 pyrG gene is isolated from the plasmid pJaL554. 

The pDV8 plasmid : 

pDV8 was kindly provided by Matthew S. Sachs, University 
of Oregon, PO Box 91000, Portland, OR 97291-1000, USA. pDV8 
30 (Fig. 7) is a pSP65 (Promega™) based plasmid containing the 
HSV-tk gene on a 1.2 kb Bglll/BamHI fragment inserted between a 
1.0 kb Xhol/Bglll fragment of the A. nidulans gpd promoter and 
a 0.8 kb BamHI/Hindlll fragment containing the A. nidulans trpC 
transcriptional terminator . 
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The A. nidulans gpd promoter and the trpC transcriptional 
terminator are taken from the plasmid pAN51-2 (Punt et al. # 
(1990), Gene 93, p. 101-109). The HSV-tk gene is described by 
McKnight S.L., (1980), Nucleic Acids Res. 8:5949-5964, Database 
5 accession no. EMBL V00470, position 252-1479. The construction 
of pDV8 is described in Vaught -Alexander D (thesis) Expression 
of the herpes simplex virus type-1 thymidine kinase gene in 
Neurospora crassa, (1994) , Oregon Graduate Institute of Sci- 
ence & Technology, University of Portland, PO Box 91000, Port- 

10 land, OR 97291-1000, USA. The sequence of pDV8 is included in 
this application as SEQ ID NO: 50. Single-, double- and multi- 
copy A. oryzae transf ormants of pDV8 were isolated by trans- 
forming a pDV8 derivative containing the A. oryzae niaD gene 
into an A. oryzae niaD mutant. The copy number of the HSV-tk 

is gene was determined by Southern analysis . ' The transf ormants and 
the untransformed host were inoculated onto plates containing 
varying concentrations of 5-f louro-2 ' -deoxyuridine . From in- 
spection of the growth on the plates it was decided to use 6 
microM of 5-f louro-2 ' -deoxyuridine in the plates for future 

20 positive/negative selection experiments. At this concentration 
none of the pDV8 transf ormants grew, while the untransformed 
host was only slightly inhibited. 

Description of pJaL554: 
25 PJaL554 was constructed by ligating the 316 bp Asp718-Nhel 
fragment to the 533 6 bp Spel-SspBI fragment from the pyrG con- 
taining plasmid pS02 (described in WO 97/35956) . Thus, pJaL554 
harbours the A., oryzae pyrG gene flanked by 316 bp repeats. 
The construction is illustrated in Fig. 8. 

30 

Construction of a prtT disruption plasmid in the pDV8 vector: 

A PCR reaction is performed on chromosomal A. oryzae 
IF04177 DNA with the primers B1042E05 and B1450E07. 
B1042E05: CGCGCGTATCCTATTGCC (SEQ ID NO: 51) 
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B1450E07: GCCGGAAATGTTGTACCTAC (SEQ ID NO: 52) . 

A fragment of 2078 basepairs is obtained and cloned into the 
pCR4Blunt-TOPO (Invitrogen™) vector. The resulting plasmid is 
sequenced with the standard M13 forward (-40) and reverse prim- 
5 ers to ensure that the correct fragment is obtained. The PCR 
fragment is excised from the vector by the restriction enzyme 
EcoRV which cuts twice internally in the fragment. The cut 
sites are located at positions 1 and 1964 in SEQ ID NO: 48, The 
1964 bp fragment is ligated with the pDV8 vector, which has 

10 been cut with Hindlll and blunt ended by filling in the ends 
with the Klenow fragment of DNA polymerase I from E. coli and 
dNTP. The resulting plasmid is cut with Hindlll, which is lo- 
cated in the prtr fragment (in the part encoding the Zn 2+ - 
finger) at position 962 in SEQ ID NO: 48, dephosphorylated and 

15 ligated with the pyrG gene isolated from pJaL554 as a 2.5 kb 
Hindlll fragment. 

Selection of prtT disrupted strains: 

The disruption plasmid described above was linearized 
with Not J and transformed into A. oryzae HowBlOl (described in 

20 WO 97/35956), a pyrG minus derivative of IF04177. The transfor- 
mation is done essential as described in EP 0 238 023. Trans- 
formants are selected on plates containing Coves salt solution 
(Cove DJ, (1966), Biochim. Biophys . Acta 113 :51-56), 1 M su- 
crose for osmotic stabilization and as carbon source, 2 0g/L 

25 agar, 10 mM NaN0 3 and 6 microM 5-f louro-2-deoxyuridine (Sigma) . 
The transformants are reisolated once on the same type of 
plates. Transformants carrying a disrupted prtr gene are iden- 
tified by Southern blot analysis. 

A strain carrying the prtr disruption is used as host for 

30 expression of a truncated PDI gene (Protein Disulfide isomerase 
gene) harbored on the expression plasmid pCaHj 44 5 (described in 
US patent 5,879,664). pCaHj445 is transformed into the A. 
oryzae prtT disrupted strain by cotransf ormation with the plas- 
mid p3SR2 containing the A. nidulans amdS gene. Transformation 
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and selection on acetamide plates is done essentially as de- 
scribed in EP 0 238 023, After reisolation the transformants 
are fermented in shake flasks or fermentors and the PDI protein 
is purified from the fermentation broth. 

5 

The invention described and claimed herein is not to be 
limited in scope by the specific embodiments herein disclosed, 
since these embodiments are intended as illustrations of 
several aspects of the invention. Any equivalent embodiments 

10 are intended to be within the scope of this invention. Indeed, 
various modifications of the invention in addition to those 
shown and described herein will become apparent to those 
skilled in the art from the foregoing description. Such 
modifications are also intended to fall within the scope of the 

is appended claims . 

Various references are cited herein, the disclosures of 
which are incorporated by reference in their entireties. 
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Claims: 

1. An isolated nucleic acid sequence encoding a polypeptide 
having transcriptional activation activity, selected from 
5 the group consisting of: 

(a) a nucleic acid sequence having at least 70% 
identity with the nucleic acid sequence of SEQ ID NO:l or 
SEQ ID NO: 48; 

(b) a nucleic acid sequence encoding a polypeptide 
10 having an amino acid sequence which has at least 50% 

identity with the amino acid sequence of SEQ ID NO: 2 or 
SEQ ID NO: 49; 

(c) a nucleic acid sequence which hybridizes under low 
stringency conditions with (i) the nucleic acid sequence 

15 of SEQ ID NO:l or SEQ ID NO: 48, or (ii) its 

complementary strand, wherein the low stringency 
conditions are defined by prehybridization and 
hybridization at 42°C in 5x SSPE, 0.3% SDS, 200 micro 
g/ml sheared and denatured salmon sperm DNA, and 25% 

20 formamide, and wash conditions are defined by 50°C for 30 

minutes in 2X SSC, 0.2% SDS; 

(d) an allelic variant of (a) , (b) , or (c) ; 

(e) a subsequence of (a) , (b) , (c) , or (d) , wherein the 
subsequence encodes a polypeptide fragment which has 

25 transcriptional activation activity; and 

(f ) a subsequence of (a) , (b) , (c) , or (d) , wherein the 
subsequence encodes a polypeptide with the amino acid 
sequence of SEQ ID NO: 3. 

30 2. The nucleic acid sequence of claim 1 which has at least 70%, 
preferably at least 80%, more preferably at least 85%, even 
more preferably at least 90%, most preferably at "least 95%, 
even most preferably at least 97%, and even more preferred 
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at least 99% identity with the nucleic acid sequence of SEQ 
ID NO:l or SEQ ID NO: 48. 

3 . The nucleic acid sequence of claim 1 which has the nucleic 
5 acid sequence of SE@ ID NO:l or SEQ ID NO: 48. 

4 . The nucleic acid sequence of any of claims 1 to 3 which 
encodes a polypeptide comprising an amino acid sequence 
which has at least 50%, preferably at least 60%, preferably 

10 at least 70%, more preferably at least 80%, even more 
preferably at least 90%, and most preferably at least 95% 
identity with the amino acid sequence of SEQ ID NO: 2 or SEQ 
ID NO: 49. 

15 5. The nucleic acid sequence of any of claims 1 to 4, wherein 
the nucleic acid sequence is obtained from a fungal cell or 
a yeast cell. 

6. The nucleic acid sequence of claim 5, wherein the fungal 
20 cell is a filamentous fungal cell. 

7. The nucleic acid sequence of claim 6, wherein the 
filamentous fungal cell is an Aspergillus, Fusarium, 
Penicillium, or Trichoderma cell. 

25 

8. The nucleic acid sequence of claim 7, wherein the 
Aspergillus cell is a strain of Aspergillus niger or 
Aspergillus oryzae, or a respective synonym or teleomorph 
thereof . 

30 

9. The nucleic acid sequence of claim 8, wherein the 
Aspergillus cell is a strain of Aspergillus niger DSM 12298 
or Aspergillus oryzae IF04177. 
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10. The nucleic acid sequence of claim 7, wherein the 
Fusarium cell is a strain of Fusarium venenatum, or a 
synonym or teleomorph thereof. 

5 11. The nucleic acid sequence of claim 5, wherein the yeast 
cell is a Hansenula, Pichia, or Saccharomyces cell. 

12 . The nucleic acid sequence of any of claims 1 to 11 which 
encodes a polypeptide comprising the amino acid sequence of 

10 SEQ ID NO: 2 or SEQ ID NO: 49, or a fragment thereof, which 
has transcriptional activation activity. 

13 . The nucleic acid sequence of any of claims 1 to 11 which 
encodes a polypeptide comprising the amino acid sequence of 

15 SEQ ID NO: 3. 

14. The nucleic acid sequence of any of claims 1 to 13, which 
hybridizes under low, preferably medium, and more preferably 
high, stringency conditions to (i) the nucleic acid sequence 

20 set forth in SEQ ID N0:1 or SEQ ID NO: 4 8 or (ii) the 
respective complementary strand, or a subsequence thereof. 

15. The nucleic acid sequence of claim 14, wherein low 
stringency conditions are defined by prehybridization and 

25 hybridization at 42°C in 5X SSPE, 0.3% SDS, 200 micro g/ml 
sheared and denatured salmon sperm DNA, and 25% formamide, 
and wash conditions are defined by 50°C for 30 minutes in 2X 
SSC, 0.2% SDS; medium stringency conditions are defined by 
prehybridization and hybridization at 42°C in 5x SSPE, 0.3% 

30 SDS, 200 microg/ml sheared and denatured salmon sperm DNA, 
and 35% formamide, and wash conditions are defined by 60°C 
for 30 minutes in 2X SSC, 0.2% SDS; and high stringency 
conditions are defined by prehybridization and hybridization 
at 42°C in 5x SSPE, 0.3% SDS, 200 microg/ml sheared and 
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denatured salmon sperm DNA, and 50% formamide, and wash 
conditions are defined by 70°C for 30 minutes in 2X SSC, 
0.2% SDS. 

5 16. The nucleic acid sequence of any of claims 1 to 15, which 
comprises the nucleic acid sequence encoding a polypeptide, 
which has transcriptional activation activity contained in 
the plasmid pEES harboured in Escherichia coli DSM 122 94 or 
DNA sequence shown in SEQ ID NO: 48 encoding polypeptide 
10 shown in SEQ ID NO: 49. 

17. A nucleic acid construct comprising the nucleic acid 
sequence of any of claims 1 to 16 operably linked to one or 
more control sequences, which direct the production of the 

15 polypeptide in a suitable expression host. 

18. An expression vector comprising the nucleic acid 
construct of claim 17, a promoter, and transcriptional and 
translational stop signals. 

20 

19. A host cell comprising the nucleic acid construct of 
claim 17 or the expression vector of claim 18. 

20. An isolated polypeptide selected from the group 
25 consisting of: 

(a) a polypeptide which is encoded in a nucleic acid 

sequence which hybridizes under low stringency conditions 
with (i) the nucleic acid sequence of SEQ ID N0:1 or SEQ 
ID NO: 48; (ii) its complementary strand, or (iii) a 
30 subsequence of SEQ ID NO:l or SEQ ID NO: 48 which encodes 

a polypeptide fragment which has transcriptional 
activation activity, wherein the low stringency 
conditions are defined by prehybridization and 
hybridization at 42 °C in 5X SSPE, 0.3% SDS, 200 micro 
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formamide, and wash conditions are defined at 50°C for 30 
minutes in 2X BSC, 0.2% SDS; 

(b) . a polypeptide having an amino acid sequence which 
5 has at least 50% identity with the amino acid sequence of 

SEQ ID NO: 2 or SEQ ID NO: 49; 

(c) an allelic variant of (a) or (b) ; 

(d) a fragment of (a) , (b) , or (c) , wherein the 
fragment has transcriptional activation activity; and 

io (e) a polypeptide comprising the amino acid sequence of 

SEQ ID NO: 3, or an allelic variant thereof. 

21. The polypeptide of claim 20, comprising an amino acid 
sequence which has at least 50%, preferably at least 60%, 

15 preferably at least 70%, more preferably at least 80%, even 
more preferably at least 90%, more preferably at least 95%, 
even more preferred 97%, and most preferred 99% identity 
with the amino acid sequence of SEQ ID NO: 2 or SEQ ID NO: 
49. 

20 

22. The polypeptide of claim 20 or 21, comprising the amino 
acid sequence of SEQ ID NO: 2 or SEQ ID NO: 49, or a fragment 
thereof, wherein the fragment retains transcriptional 
activation activity. 

25 

23. The polypeptide of any of claims 20 to 22 which is 
encoded in the nucleic acid sequence contained in plasmid 
pEES which is contained in Escherichia coli DSM 122 94 or the 
DNA sequence shown in SEQ ID NO: 48. 

30 

24. The polypeptide of any of claims 20 to 23 which comprises 
the amino acid sequence of SEQ ID NO: 3. 
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25. A method for producing the polypeptide of any of claims 20 
to 24 comprising (a) cultivating the host cell of claim 19 
under conditions conducive for production of the 
polypeptide; and (b) recovering the polypeptide. 

5 

26. A fungal host cell useful for the production of a 
polypeptide, wherein the cell: 

a) is a mutant of a parent fungal cell in which the parent 
cell comprises one or more DNA sequences encoding a 

10 protease, the transcription of which is activated by a 

transcriptional activator encoded by a nucleic acid 
sequence of any of claims 1 to 16; and 

b) produces less of the transcriptional activator and the 
protease (s) than the parent cell when cultured under the 

15 same conditions . 

27. The host cell of claim 26, wherein the reduced production 
of the transcriptional activator is obtained by modification 
or inactivation of a nucleic acid sequence present in the 

20 cell and necessary for expression of the transcriptional 
activator. 

28. The host cell of claim 26 or 27, wherein the reduced 
production of the transcriptional activator is obtained by 

25 modification or inactivation of a control sequence required 
for the expression of the polypeptide. 

29. The host cell of claim 28, wherein the control sequence is a 
promoter sequence, or a functional part thereof. 

30 

30. The host cell of any of claims 26 to 29, wherein the 
nucleic acid sequence to be modified or inactivated is the 
sequence defined in any of claims 1 to 16. 
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31. The host cell of any of claims 26 to 30, wherein the 
modification or inactivation is performed by specific or 
random mutagenesis, site-directed mutagensis, PCR generated 
mutagenesis, nucleotide insertion and/or substitution, gene 

5 interruption or gene replacement techniques, anti -sense 
techniques, or a combination thereof. 

32. A fungal host cell useful for the production of a 
polypeptide, wherein the host cell is a mutant of a parent 

10 cell, in which the mutant: 

a) produces more of a transcriptional activator encoded by a 
nucleic acid sequence of any of claims 1 to 16 than the 
parent cell when cultured under the same conditions, and 

b) comprises a DNA sequence encoding the polypeptide, the 
15 transcription of which • is activated by the 

transcriptional activator. 

33. The host cell of claim 32, wherein the host cell 
produces more of the transcriptional activator than the 

20 parent cell by introducing into the parent cell one or more 
copies of: (i) a nucleic acid sequence of any of claims 1 to 
16, (ii) the nucleic acid construct of claim 17, or (iii) 
the expression vector of claim 18, whereby the host cell 
produces more of the polypeptide than the parent cell when 

25 cultured under the same conditions. 

34. The host cell of claim 32 or 33, wherein the nucleic acid 
acid sequence encoding the transcriptional activator is 
operably linked to a promoter, which is stronger than the 

30 corresponding promoter of the parent cell. 

35. The host cell of claim 34, wherein the promoter mediates 
the expression of a gene encoding an extracellular protease, 
preferably Aspergillus oryzae alkaline protease, A. oryzae 
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neutral metalloprotease, A. niger aspergillopepsin protease, 
Fusarium oxysporum trypsin-like protease or F. venenatum 
trypsin. 

36. A fungal host cell useful for the production of a 
polypeptide, wherein the cell is a mutant of a parent cell 
in which the mutant comprises: 

a) a modification or inactivation of a transcriptional 
activator which is encoded in a native nucleic acid 
sequence of any of claims 1 to 16, or a regulatory 
sequence thereof , and 

b) (i) an inducible promoter operably linked to a nucleic 
acid sequence of any of claims 1 to 16, and (ii) a 
promoter sequence to which a transcriptional activator 
encoded by the nucleic acid sequence of any of claims 1 
to 16 can bind, operably linked to a nucleic acid 
sequence encoding the polypeptide, wherein (i) and (ii) 
can be introduced simultaneously or sequentially. 

37. The host cell of claim 36 wherein the native nucleic acid 
sequence, or a regulatory sequence thereof, is modified or 
inactivated by specific or random mutagenesis, site-directed 
mutagensis, PCR generated mutagenesis, nucleotide insertion 
and/or substitution, gene interruption or gene replacement 
techniques, anti- sense techniques, or a combination thereof. 

38. The host cell of claim 36 or 37, wherein the inducible 
promoter is selected from the group in which the induction is 
mediated by a carbon or nitrogen catabolite. 

39. The host cell of any of claims 33 to 38, which further 
comprises a promoter sequence, wherein the promoter sequence 
can be activated by the transcriptional activator and is 
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operably linked to the nucleic acid sequence encoding the 
polypeptide . 

40. The host cell of any of claims 33 to 39, wherein the 
5 promoter sequence, or a functional part thereof, is from a 

protease gene. 

41. The host cell of any of claims 33 to 40, wherein the 
protease gene is Fusarium oxysporum trypsin-like protease 

10 gene, Aspegillus oryzae alkaline protease gene, Aspergillus 
niger pacA gene, Aspergillus oryzae alkaline protease gene, 
A. oryzae neutral metalloprotease gene, A. niger 
aspergillopepsin protease gene, or F. venenatum trypsin 
gene. 

15 

42. The host cell of any of claims 26 to 41, wherein the host 
cell comprises at least one copy of a nucleic acid sequence 
encoding the polypeptide. 

20 43. The host cell of any of claims 26 to 42, wherein the host 
cell produces less of a native protease or a combination of 
native proteases than the parent cell when cultured under 
identical conditions. 

25 44. The host cell of any of claims 2 6 to 43, wherein the 
activity of the protease is assayed by the degradation of 3 H- 
labelled sperm whale myoglobin at pH 4. 

45. A method of producing a polypeptide, comprising: 

30 (a) cultivating the host cell of any of claims 26 to 44, 
wherein the host cell harbours a gene encoding the 
polypeptide, in a nutrient medium suitable for production 
of the polypeptide; and 
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(b) recovering the polypeptide from the . nutrient medium of 
the mutant cell. 

46. The method of claim 45, wherein the polypeptide is native 
5 to the parent cell. 

47. The method of claim 45, wherein the polypeptide is 
heterologous to the parent cell. 

10 48. The method of claim 45, wherein the polypeptide is an 
antibody or portions thereof, an antigen, a clotting factor, . 
an enzyme, a hormone or. a hormone variant, a receptor or 
portions thereof, a regulatory protein, a structural 
protein, a reporter, or a transport protein. 

15 

49. The method of claim 48, wherein the enzyme is an ' 
oxidoreductase, transferase, hydrolase, lyase, isomerase, or 
ligase. 

20 50. The method of claim 49, wherein the enzyme is an 
aminopeptidase , amylase , carbohydrase , carboxypept idase , 
catalase, cellulase, chitinase, cutinase, deoxyribonuclease, 
dextranase, esterase, alpha-galactosidase, beta- 

galactosidase , glucoamylase , alpha-glucosidase , beta- 

25 glucosidase, haloperoxidase, invertase, laccase, lipase, 
mannosidase, mutanase, oxidase, pectinolytic enzyme, 
peroxidase, phytase, polyphenoloxidase, proteolytic enzyme, 
ribonuclease, transglutaminase, or xylanase. 
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ICA217 ATGACCGCTT GTCATACCTG CCGCAAGCTT AAAACTCGGT 
ICA218 ATGACTGCTT GCCACACCTG CCGCAAGCTT AAAACTCGGT 



ICA217 GCGATCTTGA TCCACGAGGG CATGCCTGCC GCCGCTGCCT (SEQ ID NO: 53) 
ICA218 GCGATCTTGA TCCACGAGGG CATGCCTGCC GCCGCTGCCT (SEQ ID NO: 54) 
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SEQUENCE LISTING 

<110> Novo Nordisk A/S 
<120> 

5 

<130> 

<160> 54 

10 <170> Patentln Ver. 2.1 

<210> 1 
<211> 4098 
<212> DNA 
15 <213> E. coli DSM 12294 

<220> 

<221> exon 

<222> (977) . . (1204) 

20 

<220> 

<221> exon 

<222> (1317) . . (1718) 

25 <220> 

<221> exon 

<222> (1777) . . (2202) 

<220> 
30 <221> exon 

<222> (2253) . . (3116) 

<220> 

<221> exon 
35 <222> (3170) . . (3247) 

<400> 1 

ttggtgctgg aaagcccatt taagggatct tataaggtaa ttgccaatgt tcagtcgcct 60 

40 atggtctttg tcgagagaaa ctctttctcg ttaagatcta catgatcgct tttgattttc 120 

tctgggttca cgcggtactt tctccccgtc aatccccaac cgctgttgtg cctgaccatc 180 

aatgtggaac ggataagggg acaagagaaa ttgaaggagc gatcataaaa agctaatttt 240 

45 

ggtttattat ttttttttct tataaaactc aaaaaagaaa acgaaaacga aaaaggaaaa 300 

aagaaaaggt aaaatggaaa aagaaaggcg gtcatcactt ccaataacca tcagccaaag 360 

50 atacagacga gttactgacc ttcttatcct ggacttccgc ccgatccata tcttcatgat 420 

aagcagggaa ccgaacaaat caacgccaac ttcagcggca gttcctcact aatttcccac 480 

ttcccaccgg cgtcattttg gtcccaaccc cctccctgga agcagcggga tttagttacg 540 

55 

atccggttta catcggagac tcggaaaata ccatagcgca tgccaatcaa aacccctccc 600 

agggtgactg gccagtatca cgacccattg tttctatctt tctagaagac ctgcagggac 660 

60 atggattggc tggccgccgt gctgccgtcc attagcgtct accccaggtc aagaacggac 720 
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tggacggacc cataaccaat ctaaccaaag ccaatttcgt caattcccag ctggcgagca 780 
caatcccatt cccagggttg gccgccaact gttaaaaggc actatgtgtc tctccacctg 840 
cccgcccccc tcgatggcct gcgcgtaata actattctac tgctttttgc ctcttacttg 900 
cctcattatt agtattttac tctactctcc agattgcctg ccagcaattg gtccaaagtg 960 
10 gactttgttt gatgac atg act cga acc gtg gac gag ate aaa tac gaa acg 1012 
cct tct tea tgg gag cac aag age ttg gac gtt gee gag gat ggc agg 1060 
cga eta get ccc cat tec gac act get cgt ccg aaa ggc cgc ata cga 1108 

15 

cga teg atg act gec tgt cac aca tgt egg aag ctt aaa act aga tgt 1156 
gat eta gat ccg cgc ggt cat gcg tgc cgt cgc tgt eta tct eta agg 1204 
20 tcagaggcac tacctacctg ccagttgaag ctttgtcctt ctgaacgega catgatacta 1264 
gtcgtggaat ataactgtcc caactttget gacagtccac aatatcttta ga ate gat 1322 
tgt aag ctg cct gaa acg acc gac cgc ttc caa gac agt get gcg atg 1370 

25 

tgg cca gac gee acc teg gca att ccc tec ate gag gag cgc etc acc 1418 
tec eta gaa aga tgc atg agg gag atg acg ggc atg atg cga cag atg 1466 
30 eta gat cac tec cca ggt ttc gca aat gee teg gtt ccg cat ttg acc 1514 
aaa age ate ate acg gat gaa acc gee teg atg gag gga age ccg teg 1562 
tec ccc ttc ctg cct aag ccc gtt cgc etc att cag gac etc cag tec 1610 

35 

gac ttc ttc gga gaa gca gag act tec ccc gtt gac tec cct etc tec 1658 
age gat ggt aac gee aag ggc get ate gac tct aag eta tec etc aaa 1706 
40 ttg ttg caa acg tatgggtata cctgattgac aattaccaaa aagctgctaa 1758 
tccttggcgc aaatcagg ttt gtc gat cac ttt ggc get tgc gtt tec att 1809 
tac aat etc tec gac ate cac aac gac atg aaa gee ccc gac tct tta 1857 

45 

ctg tat aat act gca tgc ctt eta get tea cgc tat gta ccg ggg ata 1905 
ccg aca tct acc gtg cat get ata tac ctt caa gtg cga cat gca gta 1953 
50 gtc aat att ttg tgg gaa aaa cca ccc ctg aag tat gag acc etc caa 2001 
gca ctt gca ctt etc tgt etc tgg cca gca acc gee cag aaa gag cca 2049 
ccc atg gac age tgg ctg ctg agt ggt ate tea att aac cat gca att 2097 

55 

ate gcg etc gat ttc eta aac tat gcg ccc teg gaa gtc atg gtg gac 2145 
aat gaa acg get gcg cag ctg egg eta tgg aat aca tat tgc ttg aca 2193 
60 cag eta cag tgggtttcat ctaagatctc ccgtccagaa gatagctaac 2242 
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aagctttagt ttt gcg gtc ggg aat gcg cgt cct ttc cat ate cag caa 2291 
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tgc 


cag 


aca 


2579 


15 


ctc 


ctt 


cac 


CC1C 

*-y 




y^Q 


ct L.y 


agg 




cag 


ccc 


aga 


tec 


gac 


agg 


ccc 


2627 




gca 




f"Ta<*T 

y^y 


y l l 






acc 


tua 


cgt 


ctg 


ata 


ata 


4- J-IJM* 

teg 


egg 


i_ i_ _ 
ttc 


etc 


2675 


20 


nan 




c gg 


+p a r» 


i-r*h 
UCL 


ace 


gca 




age 


ccc 


gtc 


gac 


caa 


gtc 


tat 


ttc 


2723 


att 


gtc 


ggc 


tac 


get 


gca 


ctg 


aat 


ctg 


tgc 


gat 


ttc 


aat 


ctt 


atg 


gac 


2771 




ccg 


ctt 


ate 


gag 


caa 


gtg 


cag 


atg 


ttc 


ctg 


ctg 


cat 


etc 


tec 


ccg 


aac 


2819 


25 


gaa 


gac 


cac 


ate 


gee 


tac 


egg 


ttt 


teg 


tgc 


atg 


gtc 


gee 


gag 


ttc 


aag 


2867 




egg 


cga 


tgt 


ggc 


agt 


gcg 


gaa 


tgc 


aat 


gac 


cca 


tea 


tec 


act 


gtc 


aag 


2915 


30 


ggg 


tct 


ccg 


tta 


tea 


tec 


tac 


ggc 


gac 


agt 


cgt 


aag 


atg 


age 


atg 


ggg 


2963 


caa 


gca 


ccg 


ttc 


atg 


cca 


ccg 


etc 


atg 


gat 


ggc 


atg 


ate 


gag 


ggg 


tac 


3011 




ggc 


ttc 


gag 


caa 


ctg 


atg 


cca 


gaa 


gtc 


atg 


ccg 


agt 


tec 


ttt 


ccg 


gat 


3059 


35 


ggg 


ata 


etc 


aac 


gga 


atg 


cct 


gtg 


act 


ggg 


eta 


gca 


gcg 


tat 


egg 


tea 


3107 



gcg acg ctg taagtaatcg agategggtt ggaaaggaca tgagtggggg 3156 
tggtggtggt agt age agt aac acc agg gat gat aac ctg cag egg tgg 3205 

40 

ttt agt tec tgc cca tgg get gaa eta aaa ccc cga acc tag 3247 
catgatgacg tgcaacgaaa ggatcataac caaggecaag taaatactaa aataaaataa 3307 
45 tataattcca cacgatccac taccaccacc accaccggat ccatcaggtt gccttcctgc 3367 
acaggectat ttagttagag ggcccgtgcc acgaaacatc aegtaattga gcgcttttgc 3427 
ttccttgcaa cttaaacaac cccatagaca ctctcacatt cacatgccaa actactaact 3487 

50 

cctactgacc accagctgca ggaagecage cagccaccat ttcctaatcg gatatatctc 3547 
egaaaegtae gctttcctcc tttgttcgga ccgttccgtg cctccgcgga gagttgaacg 3607 
55 agtcagaaca cattcttttc gtttctatcg tttcttttcc aaggcagcag agagacgaac 3667 
aagtcagtgc ttgetaacta acttacccct cagcatttta gtaaactact atttaggaaa 3727 
gagtaatcat tcatcgaaga caagatgttt atttctccga tcgaccaaac aaaaacgttc 3787 

60 

aggtagacta agtagtagta gtagtatgtc tttgacccct ttactccact ateegttgae 3847 
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tgcacatagt agtaagtaac tatctaacca gttgccgagg agaggaaagt gagtgggtgg 3907 
gagccggagg atgccgccga gaattattaa gtcgatcatt gctagttagt tatcttttca 3967 
tgatgaggag aggaaggaga ggggggacgg gattagagaa ataaactttt ctctccaatt 4027 
aattatctgg attaattaaa acttggagag gagggtaggg gagttgggta ttggtatgtt 4087 
gctgtgaatg t 4098 



<210> 2 
<211> 666 
<212> PRT 

<213> E.coli DSM 12294 
<220> 

<223> Description of Artificial Sequence: MBL1213 
<400> 2 

Met Thr Arg Thr Val Asp Glu He Lys Tyr Glu Thr Pro Ser Ser Trp 
1 5 10 15 

Glu His Lys Ser Leu Asp Val Ala Glu Asp Gly Arg Arg Leu Ala Pro 
20 25 30 

His Ser Asp Thr Ala Arg Pro Lys Gly Arg He Arg Arg Ser Met Thr 
35 40 " 45 

Ala Cys His Thr Cys Arg Lys Leu Lys Thr Arg Cys Asp Leu Asp Pro 
50 55 60 

Arg Gly His Ala Cys Arg Arg Cys Leu Ser Leu Arg He Asp Cys Lys 
65 70 75 80 

Leu Pro Glu Thr Thr Asp Arg Phe Gin Asp Ser Ala Ala Met Trp Pro 
85 90 95 

Asp Ala Thr Ser Ala He Pro Ser He Glu Glu Arg Leu Thr Ser Leu 
100 105 110 

Glu Arg Cys Met Arg Glu Met Thr Gly Met Met Arg Gin Met Leu Asp 
115 120 125 

His Ser Pro Gly Phe Ala Asn Ala Ser Val Pro His Leu Thr Lys Ser 
130 135 140 

He He Thr Asp Glu Thr Ala Ser Met Glu Gly Ser Pro Ser Ser Pro 
145 150 155 160 

Phe Leu Pro Lys Pro Val Arg Leu He Gin Asp Leu Gin Ser Asp Phe 
165 170 175 

Phe Gly Glu Ala Glu Thr Ser Pro Val Asp Ser Pro Leu Ser Ser Asp 
180 185 190 

Gly Asn Ala Lys Gly Ala lie Asp Ser Lys Leu Ser Leu Lys Leu Leu 
195 200 205 
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Gin Thr Phe Val Asp His Phe Gly Ala Cys Val Ser He Tyr Asn Leu 
210 215 220 

Ser Asp He His Asn Asp Met Lys Ala Pro Asp Ser Leu Leu Tyr Asn 
225 230 235 240 

Thr Ala Cys Leu Leu Ala Ser Arg Tyr Val Pro Gly He Pro Thr Ser 
245 250 255 

Thr Val His Ala He Tyr Leu Gin Val Arg His Ala Val Val Asn He 
260 265 270 

Leu Trp Glu Lys Pro Pro Leu Lys Tyr Glu Thr Leu Gin Ala Leu Ala 
275 280 285 

Leu Leu Cys Leu Trp Pro Ala Thr Ala Gin Lys Glu Pro Pro Met Asp 
290 295 300 

Ser Trp Leu Leu Ser Gly He Ser He Asn His Ala He He Ala Leu 
305 310 315 320 

Asp Phe Leu Asn Tyr Ala Pro Ser Glu Val Met Val Asp Asn Glu Thr 
325 330 "* 335 

Ala Ala Gin Leu Arg Leu Trp Asn Thr Tyr Cys Leu Thr Gin Leu His 
340 345 ~ 350 

Phe Ala Val Gly Asn Ala Arg Pro Phe His He Gin Gin Arg Tyr Leu 
355 360 365 

Asp His Cys Pro Arg He Leu Glu His Pro Ala Ala Thr Leu Glu Asp 
370 375 380 

Ala Arg Val Val Ala Glu He Gin Leu Tyr Leu Met Thr Leu Arg Leu 
385 390 * 395 ■ 400 

Gin Ser Asn Ser Ser Arg Met Arg Leu Ala Asp Leu Asp Tyr Glu Glu 
405 410 * 415 

He Glu Arg Trp Lys Arg Glu Trp Ala His Leu Phe Cys Lys Lys Pro 
420 425 430 

Val Leu Val Ser Arg Gly Leu Pro Leu Thr Arg Ala Thr Ala Gly Glu 
435 440 ^ 445 

Ser Ser Thr Leu Glu Leu Ser Leu Trp Phe Cys Gin Thr Leu Leu His 
450 455 "* 460 

Arg Thr Ala Met Arg Leu Gin Pro Arg Ser Asp Arg Leu Ala Ser Glu 
465 470 475 480 

Val Leu Gin Thr Ser Arg Leu He He Ser Arg Phe Leu Gin He Arg 
485 490 495 

Tyr Ser Thr Ala Leu Ser Leu Val Asp Gin Val Tyr Phe He Val Gly 
500 505 "* 510 

Tyr Ala Ala Leu Asn Leu Cys Asp Phe Asn Leu Met Asp Pro Leu He 
515 520 525 



Glu Gin Val Gin Met Phe Leu Leu His Leu Ser Pro Asn Glu Asp His 
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530 



535 



540 



He Ala Tyr Arg Phe Ser Cys Met Val Ala Glu Phe Lys Arg Arg Cys 
545 550 555 560 

Gly Ser Ala Glu Cys Asn Asp Pro Ser Ser Thr Val Lys Gly Ser Pro 
565 570 575 

Leu Ser Ser Tyr Gly Asp Ser Arg Lys Met Ser Met Gly Gin Ala Pro 
580 585 590 

Phe Met Pro Pro Leu Met Asp Gly Met He Glu Gly Tyr Gly Phe Glu 
595 600 605 

Gin Leu Met Pro Glu Val Met Pro Ser Ser Phe Pro Asp Gly lie Leu 
610 615 620 

Asn Gly Met Pro Val Thr Gly Leu Ala Ala Tyr Arg Ser Ala Thr Leu 
625 630 635 ~ 640 

Ser Ser Asn Thr Arg Asp Asp Asn Leu Gin Arg Trp Phe Ser Ser Cys 
645 650 655 

Pro Trp Ala Glu Leu Lys Pro Arg Thr Pro 
660 665 



<210> 3 
<211> 35 
<212> PRT 

<213> Aspergillus niger 
<220> 

<223> Description of Artificial Sequence: PepApr 
<400> 3 

Met Thr Ala Cys His Thr Cys Arg Lys Leu Lys Thr Arg Cys Asp Leu 
15 10 ^ ~ 15 

Asp Pro Arg Gly His Ala Cys Arg Arg Cys Leu Ser Leu Arg lie Asp 
20 25 30 

Cys Lys Leu 
35 



<210> 4 
<211> 2542 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR Fragment 
prtl3 allele 

<400> 4 

gacatggatt ggctggccgc cgtgctgccg tccattagcg tctaccccag gtcaagaacg 60 

gactggacgg acccataacc aatctaacca aagccaattt cgtcaattcc cagctggcga 120 

gcacaatccc attcccaggg ttggccgcca actgttaaaa ggcactatgt gtctctccac 180 
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ctgcccgccc ccctcgatgg cctgcgcgta 
ttgcctcatt attagtattt tactctactc 
gtggactttg tttgatgaca tgactcgaac 
ttcatgggag cacaagagct tggacgttgc 
5 cgacactgct cgtccgaaag gccgcatacg 
gaagcttaaa actagatgtg atctagatcc 
tctaaggtca gaggcactac ctacctgcca 
gatactagtc gtggaatata actgtcccaa 
tcgattgtaa gctgcctgaa acgaccgacc 

10 acgccacctc ggcaattccc tccatcgagg 
gggagatgac gggcatgatg cgacagatgc 
cggttccgca tttgaccaaa agcatcatca 
cgtcgtcccc cttcctgcct aagcccgttc 
tcggagaagc agagacttcc cccgttgact 

15 gcgctatcga ctctaagcta tccctcaaat 
caattaccaa aaagctgcta atccttggcg 
tgcgtttcca tttacaatct ctccgacatc 
ctgtataata ctgcatgcct tctagcttca 
gtgcatgcta tataccttca agtgcgacat 

20 cccctgaagt atgagaccct ccaagcactt 
cagaaagagc cacccatgga cagctggctg 
atcgcgctcg atttcctaaa ctatgcgccc 
gcgcagctgc ggctatggaa tacatattgc 
atctcccgtc cagaagatag ctaacaagct 

25 tccatatcca gcaaagatac cttgaccact 
ctctggagga cgcaagggtt gtagcagaaa 
agagcaatag cagtcgaatg cggttggcgg 
agagggagtg ggctcacctt ttctgtaaga 
tgacgagagc aacagctggg gaaagttcca 

30 cactccttca ccgcacagca atgaggcttc 
ttctgcaaac ctcacgtctg ataatatcgc 
taagccttgt cgaccaagtc tatttcattg 
tcaatcttat ggacccgctt atcgagcaag 
acgaagacca catcgcctac cggttttcgt 

35 gcagtgcgga atgcaatgac ccatcatcca 
gcgacagtcg taagatgagc atggggcaag 
tgatcgaggg gtacggcttc gagcaactga 
atgggatact caacggaatg cctgtgactg 
aagtaatcga gatcgggttg gaaaggacat 

40 caccagggat gataacctgc ag 



ataactattc tactgctttt tgcctcttac 240 
tccagattgc ctgccagcaa ttggtccaaa 300 
cgtggacgag atcaaatacg aaacgccttc 360 
cgaggatggc aggcgactag ctccccattc 420 
acgatcgatg actgcctgtc acacatgtcg 480 
gcgcggtcat gcgtgccgtc gctgtctatc 540 
gttgaagctt tgtccttctg aacgcgacat 600 
ctttgctgac agtccacaat atctttagaa 660 
gcttccaaga cagtgctgcg atgtggccag 720 
agcgcctcac ctccctagaa agatgcatga 780 
tagatcactc cccaggtttc gcaaatgcct 840 
cggatgaaac cgcctcgatg gagggaagcc 900 
gcctcattca ggacctccag tccgacttct 960 
cccctctctc cagcgatggt aacgccaagg 1020 
tgttgcaaac gtatgggtat acctgattga 1080 
caaatcaggt ttgtcgatca ctttggcgct 1140 
cacaacgaca tgaaagcccc cgactcttta 1*200 
cgctatgtac cggggatacc gacatctacc 1260 
gcagtagtca atattttgtg ggaaaaacca 1320 
gcacttctct gtctctggcc agcaaccgcc 1380 
ctgagtggta tctcaattaa ccatgcaatt 1440 
tcggaagtca tggtggacaa tgaaacggct 1500 
ttgacacagc tacagtgggt ttcatctaag 1560 
ttagttttgc ggtcgggaat gcgcgtcctt 1620 
gcccacggat actggagcac ccagcagcaa 1680 
tacagttgt'a tttgatgaca ttgcggctcc 1740 
accttgacta tgaggaaata gagcgatgga 1800 
agcctgttct tgtttcccgg ggactaccac 1860 
cattggagct gagcctttgg ttctgccaga 1920 
agcccagatc cgacaggctc gcatctgagg 1980 
ggttcctcca gatccggtac tctaccgcat 2040 
tcggctacgc tgcactgaat ctgtgcgatt 2100 
tgcagatgtt cctgctgcat ctctccccga 2160 
gcatggtcgc cgagttcaag cggcgatgtg 2220 
ctgtcaaggg gtctccgtta tcatcctacg 2280 
caccgttcat gccaccgctc atggatggca 2340 
tgccagaagt catgccgagt tcctttccgg 2400 
ggctagcagc gtatcggtca gcgacgctgt 2460 
gagtgggggt ggtggtggta gtagcagtaa 2520 

2542 



<210> 5 
<211> 665 
45 <212> PRT 

<213> Aspergillus niger 

<400> 5 

Met Thr Arg Thr Val Asp Glu He Lys Tyr Glu Thr Pro Ser Ser Trp 
50 1 5 10 15 

Glu His Lys Ser Leu Asp Val Ala Glu Asp Gly Arg Arg Leu Ala Pro 
20 25 "* 30 

55 His Ser Asp Thr Ala Arg Pro Lys Gly Arg He Arg Arg Ser Met Thr 
35 40 45 



60 



Ala Cys His Thr Cys Arg Lys Leu Lys Thr Arg Cys Asp Leu Asp Pro 
50 55 60 

Arg Gly His Ala Cys Arg Arg Cys Leu Ser Leu Arg He Asp Cys Lys 



WO 01/68864 



PCTYDK01/00169 



8 



65 



70 



75 



80 



Leu Pro Glu Thr Thr Asp Arg Phe Gin Asp Ser Ala Ala Met Trp Pro 
85 90 95 

Asp Ala Thr Ser Ala He Pro Ser He Glu Glu Arg Leu Thr Ser Pro 
100 105 110 

Glu Arg Cys Met Arg Glu Met Thr Gly Met Met Arg Gin Met Leu Asp 
115 120 125 

His Ser Pro Gly Phe Ala Asn Ala Ser Val Pro His Leu Thr Lys Ser 
130 135 140 

He He Thr Asp Glu Thr Ala Ser Met Glu Gly Ser Pro Ser Ser Pro 
145 150 155 160 

Phe Leu Pro Lys Pro Val Arg Leu He Gin Asp Leu Gin Ser Asp Phe 
165 170 ^ 175 

Phe Gly Glu Ala Glu Thr Ser Pro Val Asp Ser Pro Leu Ser Ser Asp 
180 185 190 

Gly Asn Ala Lys Gly Ala He Asp Ser Lys Leu Ser Leu Lys Leu Leu 
195 200 205 

Gin Thr Phe Val Asp His Phe Gly Ala Cys Val Ser He Tyr Asn Leu 
210 215 220 

Ser Asp He His Asn Asp Met Lys Ala Pro Asp Ser Leu Leu Tyr Asn 
225 230 235 240 

Thr Ala Cys Leu Leu Ala Ser Arg Tyr Val Pro Gly He Pro Thr Ser 
245 " 250 255 

Thr Val His Ala He Tyr Leu Gin Val Arg His Ala Val Val Asn He 
260 265 270 

Leu Trp Glu Lys Pro Pro Leu Lys Tyr Glu Thr Leu Gin Ala Leu Ala 
275 280 285 

Leu Leu Cys Leu Trp Pro Ala Thr Ala Gin Lys Glu Pro Pro Met Asp 
290 295 300 

Ser Trp Leu Leu Ser Gly He Ser He Asn His Ala lie He Ala Leu 
305 310 315 320 

Asp Phe Leu Asn Tyr Ala Pro Ser Glu Val Met Val Asp Asn Glu Thr 
325 330 335 

Ala Ala Gin Leu Arg Leu Trp Asn Thr Tyr Cys Leu Thr Gin Leu His 
340 345 350 

Phe Ala Val Gly Asn Ala Arg Pro Phe His He Gin Gin Arg Tyr Leu 
355 360 365 

Asp His Cys Pro Arg He Leu Glu His Pro Ala Ala Thr Leu Glu Asp 
370 375 .380 



Ala Arg Val Val Ala Glu He Gin Leu Tyr Leu Met Thr Leu Arg Leu 
385 390 395 400 
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Gin Ser Asn 

lie Glu Arg 

Val Leu Val 
435 

Ser Ser Thr 
450 

Arg Thr Ala 
465 

Val Leu Gin 
Tyr Ser Thr 



Tyr Ala Ala 
515 

Glu Gin Val 
530 

lie Ala Tyr 
545 



Ser Ser Arg Met Arg Leu Ala Asp 
405 410 

Trp Lys Arg Glu Trp Ala. His Leu 
420 425 

Ser Arg Gly Leu Pro Leu Thr Arg 
440 

Leu Glu Leu Ser Leu Trp Phe Cys 
455 

Met Arg Leu Gin Pro Arg Ser Asp 

470 475 

Thr Ser Arg Leu He He Ser Arg 
485 490 

Ala Leu Ser Leu Val Asp Gin Val 
500 505 

Leu Asn Leu Cys Asp Phe Asn Leu 
520 

Gin Met Phe Leu Leu His Leu Ser 
535 

Arg Phe Ser Cys Met Val Ala Glu 
550 555 



Gly Ser Ala Glu Cys Asn Asp Pro Ser Ser Thr 
565 * 570 

Leu Ser Ser Tyr Gly Asp Ser Arg Lys Met Ser 
580 585 

Phe Met Pro Pro Leu Met Asp Gly Met He Glu 
595 600 

Gin Leu Met Pro Glu Val Met Pro Ser Ser Phe 
610 615 

Asn Gly Met Pro Val Thr Gly Leu Ala Ala Tyr 
625 630 635 

Ser Ser Asn Thr Arg Asp Asp Asn Leu Gin Arg 
645 650 

Pro Trp Ala Glu Leu Lys Pro Arg Thr 
660 ~ 665 



Leu Asp Tyr Glu Glu 
415 

Phe Cys Lys Lys Pro 
430 

Ala Thr Ala Gly Glu 
445 

Gin Thr Leu Leu His 
460 

Arg Leu Ala Ser Glu 
480 

Phe Leu Gin He Arg 
495 

Tyr Phe He Val Gly 
510 

Met Asp Pro Leu He 
525 

Pro Asn Glu Asp His 
540 

Phe Lys Arg Arg Cys 
560 

Val Lys Gly Ser Pro 
575 

Met Gly Gin Ala Pro 
590 

Gly Tyr Gly Phe Glu 
605 

Pro Asp Gly He Leu 
620 

Arg Ser Ala Thr Leu 
640 

Trp Phe Ser Ser Cys 
655 



<210> 6 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PepApr 
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<400> 6 

cggaattcgc atgctggagg tgcttctaa 29 

5 

<210> 7 
<211> 33 
<212> DNA 

<213> Artificial Sequence 

10 

<220> 

<223> Description of Artificial Sequence: PepA/amdS 
<400> 7 

15 ttcccaggat tgaggcattt tgaccacgag att 33 



<210> 8 
<211> 17 
20 <212> DNA 

<213> Artificial Sequence 



25 



<220> 

<223> Description of Artificial Sequence: MBL1213 
<400> 8 

taacttccac cgaggtc 17 



30 <210> 9 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
35 <220> 

<223> Description of Artificial Sequence: Primer 122958 
<400> 9 

cgatcgatga ctgcctgt 18 

40 



<210> 10 
<211> 18 
<212> DNA 
45 <213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 122956 
50 <400> 10 

agagacacat agtgcctt 18 



<210> 11 
55 <211> 18 
<212> DNA 

<213> Artificial Sequence 



<220> 

60 <223> Description of Artificial Sequence: Primer 122959 
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<400> 11 

gcttatagtc gatagcgc 



5 <210> 12 
<211> 18 
<212> DNA ' 

<213> Artificial Sequence 

10 <220> 

<223> Description of Artificial 

<400> 12 

cctctctcca gcgatggt 

15 

<210> 13 
<211> 18 
<212> DNA 
20 <213> Artificial Sequence 

<220> 

<223> Description of Artificial 

25 <400> 13 

atggaataca tactgctt 



<210> 14 
30 <211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

35 <223> Description of Artificial 
<400> 14 

atgaaaccca ctgtagct 

40 

<210> 15 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

45 

<220> 

<223> Description of Artificial 

<400> 15 
50 tgctcgataa gcgggtcc 



<210> 16 
<211> 18 
55 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial 

60 

<400> 16 



11 

18 

Sequence: Primer 122960 

18 

Sequence: Primerl22962 

18 

Sequence : Primerl22961 

18 

Sequence: Primer 122963 

18 

Sequence: Primer 122964 
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attcttatgg acccgctt 



<210> 17 
5 <211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

10 <223> Description of Artificial 
<400> 17 

ccccgggaaa caagaacagg 

15 

<210> 18 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

20 

<220> 

<223> Description of Artificial 

<400> 18 
25 gttggcggac cttgactatg 



<210> 19 
<211> 22 
30 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial 

35 

<400> 19 

acagctacag tgggtttcat ct 



40 <210> 20 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

45 <220> 

<223> Description of Artificial 

<400> 20 

agtcaacggg ggaagtctc 

50 

<210> 21 
<211> 20 
<212> DNA 
55 <213> Artificial Sequence 

<220> 

<223> Description of Artificial 

60 <400> 21 

ctagcagcgt atcggtcagc 



12 

18 

Sequence: Primer 124289 

20 

Sequence: Primer 124290 

20 

Sequence: Primer 125112 

22 

Sequence : Primer 125111 

19 

Sequence: Primer 128330 

20 
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<210> 22 
<211> 19 
5 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Primer 130887 

0 

<400> 22 

cttggaaaag aaacgatag 19 



15 <210> 23 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
20 <220> 

<223> Description of Artificial Sequence: Primer 130888 
<400> 23 

aacgtacgct ttcctcctt 19 

25 

<210> 24 
<211> 21 
<212> DNA 
30 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Primer 134135 
35 <400> 24 

gggtccgtcc agtccgttct t 21 



<210> 25 
40 <211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

45 <223> Description of Artificial Sequence: Primer -48 
reverse 

<400> 25 

agcggataac aatttcacac agga 24 

50 

<210> 26 
<211> 17 
<212> DNA 
55 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Primer-40 
Universal 

60 

<400> 26 
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gttttcccag tcacgac 17 



<210> 27 
5 <211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

10 <223> Description of Artificial Sequence: Primer PstI 
<400> 27 

tcatccctgg tgttactgc 19 

15 

<210> 28 
<211> 17 
<212> DNA 

<213> Artificial Sequence 

20 

<220> 

<223> Description of Artificial Sequence: Pstll 
<400> 28 

25 catggattgg ctggccg 17 



<210> 29 
<211> 18 
30 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Prt270n 

35 

<400> 29 

tactctccag attgcctg 18 



40 <210> 30 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
45 <220> 

<223> Description of Artificial Sequence: Prtl420r 
<400> 30 

tgagatacca ctcagcag 18 

50 

<210> 31 
<211> 18 
<212> DNA 
55 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Prtl350n 
60 <400> 31 

tgcacttctc tgtctctg 18 
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<210> 32 

<211> 18 

5 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Prt2365r 

10 

<400> 32 

gacttctggc atcagttg 18 



15 <210> 33 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
20 <220> 

<223> Description of Artificial Sequence: Prt2320n 
<400> 33 

ctcatggatg gcatgatc 18 

25 

<210> 34 
<211> 33 
<212> PRT 
30 <213> Aspergillus niger 

<220> 

<221> ZN_FING 
<222> (1) . . (33) 

35 

<400> 34 

Met Thr Ala Cys His Thr Cys Arg Lys Leu Lys Thr Arg Cys Asp Leu 
15 10 ~ 15 

40 Asp Pro Arg Gly His Ala Cys Arg Arg Cys Leu Ser Leu Arg He Asp 
20 25 30 

Cys 

45 



<210> 35 
<211> 20 
50 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Degenerated 
55 Primer 137396 

<400> 35 

atgacygcyt gycayacytg 20 

60 

<210> 36 
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<211> 20 
<212> DNA 

<213> Artificial Sequence 
5 <220> 

<223> Description of Artificial Sequence: Degenerated 
Primer 137397 

<400> 36 

10 arrcancgnc grcargcrtg 20 



<210> 37 
<211> 20 
15 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Primer 13 7394 

20 

<400> 37 

atgactgcct gtcacacatg 20 



25 <210> 38 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
30 <220> 

<223> Description of Artificial Sequence: Primer 137395 
<400> 38 

agacagcgac ggcacgcatg 20 

35 

<210> 39 
<211> 21 
<212> DNA 
40 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Primer 144428 
45 <400> 39 

caccgagttt taagcttgcg g 21 



<210> 40 
50 <211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

55 <223> Description of Artificial Sequence: Primer 144429 
<400> 40 

gcgatcttga tccacgaggg 20 



60 



<210> 41 
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<211> 21 
<212> DNA 

<213> Artificial Sequence 

5 <220> 

<223> Description of Artificial 

<400> 41 

cgggatgaat tgtagagagg c 

10 

<210> 42 
<211> 25 
<212> DNA 
15 <213> Artificial Sequence 

<220> 

<223> Description of Artificial 

20 <400> 42 

cgcaagctta aaactcggtg cgatc 



<210> 43 
25 <211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

30 <223> Description of Artificial 
<400> 43 

cctcgtggat caagatcgca 

35 

<210> 44 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

40 

<220> 

<223> Description of Artificial 

<400> 44 
45 gatgaaaaga ataatcggcg ag 

<210> 45 
<211> 21 
50 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial 

55 

<400> 45 

cgcggcacac tacccccgtt g 



Sequence: Primer 153468 

21 

Sequence: Primer 140358 

25 

Sequence: Primer 140359 

20 

Sequence: Primer 175653 

22 

Sequence: Primer 175654 

21 



60 <210> 46 
<211> 22 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer B043G08 
<400> 46 

atctagctca agcattagcg gc 22 

<210> 47 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
B0403G09 

20 <400> 47 

aatttcggcc ctttagtgtc c 21 



10 



15 



<210> 48 
25 <211> 2931 
<212> DNA 

<213> Aspergillus oryzae 

<220> 
30 <221> intron 

<222> (1028) . . (1135) 

<220> 

<221> intron 
35 <222> (1538) . . (1591) 

<220> 

<221> intron 

<222> (2018) . . (2066) 

40 

<220> 

<221> intron 

<222> (2297) , . (2347) 

45 <400> 48 

gatatctcat gatctgcgtg atcggcttgc ctcctatctt agatcacccg ggcttcttca 60 
aatcagcaac aacgctcaga catgtcccct gagaggtgat ccaaatcata cacgagagaa 120 
cgcggaaacg caaattaagg atgagcgaaa aagagaaaaa aatccgttgt tcctgagtca 180 
tgacgaatga gcaaaagtca aacacacctt ctgcttttgg ggggtatgcc cgatcacaat 240 

50 cttcaacccg ccatgataag agacacacgc tatcgacaaa tcaccggagg tcaagattag 3 00 
tggcagtcct tagctaattt caggtcggcg tcaaccttag ccaacccaac ccaaccccct 360 
catggaagcg ggactcccta tggagccggc ttacatcggg cgcactgcaa tggcgcacgt 420 
caatcaaccc ctctcttgtt gcagtgccta gtatgccaaa ccaccctttc tattcttcta 480 
gaaaccacac cctagagact cggatctaca cggattggtt ggaatgctcc gattagttgg 540 

55 catttacccc aggtcaaaat ggataatcaa tctaacggag tctatttcgt caactgcctg 600 
ccagctagca caatctcctc ttcacgcccg gccgtgggct gttaaaaggg tcaattccct 660 
ccccacctgt gtggattctc tatgatttgc acgggatctg acttggtttc cacaattctt 720 
cttgctctca gcttgttcta ctcgccgatt attcttttca tcaacgcggc acactacccc 780 
cgttgtctga tgtcatgact agaactactg ttgaacctat caaatatgag gccccttcgt 840 

60 gggagcataa gagcgtgcat gtgtccgacg accacaggag aatcatcccc aatgtcggcg 900 
acgacgcgac gcgcccaaag ggccgcatta gacgttcaat gaccgcttgt aatacctgcc 960 
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gcaagcttaa 
ctttaaggtc 
tcccatgtcc 
actgtcagct 
5 caacgacagc 
agatgaccgg 
tccctccgct 
tcggtccttt 
gggagacaaa 

10 tcttagattc 
ctgcctctat 
ccataaataa 
gtactgcctg 
ccatgaacct 

15 aatacgaatc 
agttccccat 
ttgacttcct 
tgcggctctg 
actccaactt 

20 ccacagagat 
gatggcaagg 
gagcaacgta 
tgggcccatc 
tcaacagctg 

25 ctgcatcgga 
caaggatcgc 
gtggtcgatc 
ttcctcgacc 
gaccacatcg 

30 tgcaacgacc 
atgagcatgg 
agcgctctgg 
agtggcatgg 

35 



40 



aactcggtgc 
gggtgccacc 
tgccaagacc 
ccccgagacg 
tatcccctcc 
catgcttcgg 
agctcggagt 
cctacctaaa 
ccgcatccct 
taagttgtct 
agccgctaat 
tcagtcggac 
tcttctggcc 
ccaagttcga 
gctccaggca 
agatggctgg 
caatcatgtg 
gaacgctttc 
gctaatgcag 
atctcgatta 
tcgtagcaga 
tgcgattcgc 
ttcttggtaa 
gtgatgaaaa 
cagcaatgag 
gcttgatcat 
aggcgtactt 
ccctcattga 
cataccggtt 
cttgcagcgc 
aacaggtaca 
aacagctgat 
ctgtgactga 



gatcttgatc 
gttatccact 
aaataagcct 
agtgagcgct 
atcgaggagc 
cagatcttga 
gttcatacgg 
cccgttcggc 
gttgaatctc 
ctcaagttgg 
gcttgagcta 
ttccacaacg 
tcccgatatg 
cataaggcag 
ctcgcccttc 
ttactgagcg 
ccttcagagc 
tgtttaacac 
aaatagtttc 
ttgcccacga 
gatccagttg 
ggaggttgaa 
ggttaagcaa 
ttcaacattt 
gttccaagcg 
ctcgaaattc 
catcgtcggt 
ccagatccag 
ttcgtgcatg 
agtcgacggt 
attcgtgcca 
ccctgaggtc 
agccatccct 



cacgagggca 
ttgtcaaatc 
ttcttgagta 
ttcaggacag 
gtctcacttc 
atcaatcacc 
aagaaacggc 
taattcagga 
ctttcttggg 
tacagctgta 
gatttgtgga 
agatgaggaa 
tgccaggcat 
tcaatctgct 
tttgtttatg 
ggactgcaat 
ttctcattga 
agttacagta 
gctgttggca 
cttcttgagc 
tacttgatca 
tacgaagaga 
cgaggaccat 
gagcttagtc 
gagtctgaga 
ctgcaactcc 
tatgccgctc 
atgtttctgc 
atagccgagt 
tctcaatgct 
ccactagtag 
atgccacact 
gtgggatcgg 



tgcatgccgg 
tcttacgtca 
ctaatgtttc 
tactccaatg 
cctagagagg 
aagcgtctct 
ctccattgaa 
cctccaatct 
taacagtttt 
tggtcactcg 
taatttcggc 
caccgattcg 
accaccaccg 
gtgggaagaa 
gccagcggcg 
caatcatgcc 
taacgatatc 
ggtacaacat 
acgcacgtcc 
accccgctgc 
cattgcgact 
ttgaacgatg 
ctcatataaa 
tctggttctg 
gactcacgtc 
gatttgtcac 
taaatctttg 
tgcatctgtc 
tcaagcgtcg 
cgttcggaga 
atagcatgat 
catttccgga 
cgccatacta 



cggtgtctat 
aaatggggga 
tataggatcg 
tggtcagacg 
agtatgagag 
aatatctccg 
ggaaactcat 
gagttttttg 
gagaagggta 
tcatgtccat 
cctttagtgt 
ttgttatata 
attgtccata 
ccgcctttga 
ggtcaaaagg 
ctcgtctcct 
gccgctcaat 
ttccggctta 
attccattta 
aacagttgag 
ccaagccaac 
gaaagttgaa 
tgctaactat 
tcaaatcctc 
ggaaattctc 
cgctctaaga 
cgacttcaac 
gccaaacgaa 
ctgtgccgaa 
tgcccggaag 
tgggggatat 
aagtgtcata 
9 



<210> 49 
<2U> 624 
<212> PRT 

<213> Aspergillus oryzae 
<400> 49 

Met Thr Arg Thr Thr Val Glu Pro lie Lys Tyr Glu Ala Pro Ser Trp 
15 10 15 

45 Glu His Lys Ser Val His Val Ser Asp Asp His Gly Arg lie lie Pro 
20 25 " ' 30 



50 



Asn Val Gly Asp Asp Ala Thr Arg Pro Lys Gly Arg lie Arg Arg Ser 
35 40 45 

Met Thr Ala Cys Asn Thr Cys Arg Lys Leu Lys Thr Arg Cys Asp Leu 
50 55 60 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2931 



Asp Pro Arg Gly His Ala Cys Arg Arg Cys Leu Ser Leu Arg lie Asp 
55 65 70 75 80 



Cys Gin Leu Pro Glu Thr Ser Glu Arg Phe Gin Asp Ser Thr Pro Met 
85 ~ 90 95 



60 



Trp Ser Asp Ala Thr Thr Ala lie Pro Ser lie Glu Glu Arg Leu Thr 
100 105 110 
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Ser Leu Glu Arg Ser Met Arg Glu Met Thr Gly Met Leu Arg Gin lie 
115 120 125 

Leu Asn Gin Ser Pro Ser Val Ser Asn lie Ser Val Pro Pro Leu Ala 
130 135 140 

Arg Ser Val His Thr Glu Glu Thr Ala Ser lie Glu Gly Asn Ser Phe 
145 150 155 160 

Gly Pro Phe Leu Pro Lys Pro Val Arg Leu lie Gin Asp Leu Gin Ser 
165 170 175 

Glu Phe Phe Gly Glu Thr Asn Arg He Pro Val Glu Ser Pro Phe Leu 
180 185 190 

Gly Asn Ser Phe Glu Lys Gly He Leu Asp Ser Lys Leu Ser Leu Lys 
195 200 205 

Leu Val Gin Leu Phe Val Asp Asn Phe Gly Pro Leu Val Ser He Asn 
210 215 * 220 

Asn Gin Ser Asp Phe His Asn Glu Met Arg Asn Thr Asp Ser Leu Leu 
225 230 235 240 

Tyr Ser Thr Ala Cys Leu Leu Ala Ser Arg Tyr Val Pro Gly He Pro 
245 250 255 

Pro Pro He Val His Thr Met Asn Leu Gin Val Arg His Lys Ala Val 
260 265 270 

Asn Leu Leu Trp Glu Glu Pro Pro Leu Lys Tyr Glu Ser Leu Gin Ala 
275 280 285 

Leu Ala Leu Leu Cys Leu Trp Pro Ala Ala Gly Gin Lys Glu Phe Pro 
290 295 300 

lie Asp Gly Trp Leu Leu Ser Gly Thr Ala lie Asn His Ala Leu Val 
305 310 315 320 

Ser Phe Asp Phe Leu Asn His Val Pro Ser Glu Leu Leu lie Asp Asn 
325 330 335 

Asp He Ala Ala Gin Leu Arg Leu Trp Asn Ala Phe Cys Leu Thr Gin 
340 345 350 

Leu His Phe Ala Val Gly Asn Ala Arg Pro Phe His Leu Pro Gin Arg 
355 360 365 

Tyr Leu Asp Tyr Cys Pro Arg Leu Leu Glu His Pro Ala Ala Thr Val 
370 375 380 

Glu Asp Gly Lys Val Val Ala Glu He Gin Leu Tyr Leu lie Thr Leu 
385 390 395 400 

Arg Leu Gin Ala Asn Glu Gin Arg Met Arg Phe Ala Glu Val Glu Tyr 
405 410 415 

Glu Glu lie Glu Arg Trp Lys Val Glu Trp Ala His Leu Leu Ala Gly 
420 425 430 
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aggtacagaa gtccaattgc ttccgatctg 
gaagtaggta gagcgagtac ccggcgcgta 
agggcgtcca aatatcgtgc ctctcctgct 
tcaggagctg gccagcggcg cagaccggga 
5 tctgcactcg acctgctgag gtccctcagt 
gcccggtgtg tcggcggggt tgacaaggtc 
tttcctgctc tccccaccag ctgtagatct 
ccagcgcctt gtagaagcgc gtatggcttc 
cgaccaggct gcgcgttctc gcggccatag 

10 gcagcaagaa gccacggaag tccgcccgga 
tatagacggt ccccacggga tggggaaaac 
ttcgcgcgac gatatcgtct acgtacccga 
ttccgagaca atcgcgaaca tctacaccac 
ggccggggac gcggcggtgg taatgacaag 

15 cgtgaccgac gccgttctgg ctcctcatat 
gcccccggcc ctcaccctca tcttcgaccg 
ggccgcgcgg taccttatgg gcagcatgac 
catcccgccg accttgcccg gcaccaacat 
catcgaccgc ctggccaaac gccagcgccc 

20 tgcgattcgc cgcgtttacg ggctacttgc 
gtcgtggcgg gaggactggg gacagctttc 
gccccagagc aacgcgggcc cacgacccca 
gggccccgag ttgctggccc ccaacggcga 
cgtcttggcc aaacgcctcc gttccatgca 

25 cgccggctgc cgggacgccc tgctgcaact 
cacccccggc tccataccga cgatatgcga 
ggaggctaac tgaaacacgg aaggagacaa 
acttaacgtt actgaaatca tcaaacagct 
cgatgtcagc tccggagttg agacaaatgg 

30 tgtccaagca gcaaagagtg ccttctagtg 
acgcgttttc gggtttacct cttccagata 
ctgcaaccta gtaacgcctt caggctccgg 
ttcattttcg ggagacgaga tcaagcagat 
ggaatccgct cttggctcca cgcgactata 

35 cctcttcttt actctgatag cttgactatg 
agataattgc atgtttcttc cttgaactct 
tataaacctc gaaatcattc ctactaagat 
tagtgaatgc tccgtaacac ccaatacgcc 
gtcgtttacc cagaatgcac aggtacactt 

40 ctcgtgtact gtgtaagcgc ccactccaca 
ggcgtaatca tggtcatagc tgtttcctgt 
caacatacga gccggaagca taaagtgtaa 
cacattaatt gcgttgcgct cactgcccgc 
cggccgctct gcattaatga atcggccaac 

45 gctcttccgc ttcctcgctc actgactcgc 
tatcagctca ctcaaaggcg gtaatacggt 
agaacatgtg agcaaaaggc cagcaaaagg 
cgtttttcca taggctccgc ccccctgacg 
ggtggcgaaa cccgacagga ctataaagat 

50 tgcgctctcc tgttccgacc ctgccgctta 
gaagcgtggc gctttctcat agctcacgct 
gctccaagct gggctgtgtg cacgaacccc 
gtaactatcg tcttgagtcc aacccggtaa 
ctggtaacag gattagcaga gcgaggtatg 

55 ggcctaacta cggctacact agaaggacag 
ttaccttcgg aaaaagagtt ggtagctctt 
gtggtttttt tgtttgcaag cagcagatta 
ctttgatctt ttctacgggg tctgacgctc 
tggtcatgag attatcaaaa aggatcttca 

60 ttaaatcaat ctaaagtata tatgagtaaa 
gtgaggcacc tatctcagcg atctgtctat 



gtaaaagatt cacgagatag taccttctcc 720 
agctccctaa ttggcccatc cggcatctgt 780 
ttgcccggtg tatgaaaccg gaaaggccgc 840 
acacaagctg gcagtcgacc catccggtgc 900 
ccctggtagg cagctttgcc ccgtctgtcc 960 
gttgcgtcag tccaacattt gttgccatat 1020 
tggtggcgtg aaactcccgc acctcttcgg 1080 
gtaccccggc catcaacacg cgtctgcgtt 1140 
caaccgacgt acggcgttgc gccctcgccg 1200 
gcagaaaatg cccacgctac tgcgggttta 1260 
caccaccacg caactgctgg tggccctggg 1320 
gccgatgact tactggcggg tgctgggggc 1380 
acaacaccgc ctcgaccagg gtgagatatc 1440 
cgcccagata acaatgggca tgccttatgc 1500 
cgggggggag gctgggagct cacatgcccc 1560 
ccatcccatc gccgccctcc tgtgctaccc 1620 
cccccaggcc gtgctggcgt tcgtggccct 1680 
cgtgcttggg gcccttccgg aggacagaca 1740 
cggcgagcgg ctggacctgg ctatgctggc 1800 
caatacggtg cggtatctgc agtgcggcgg 1860 
ggggacggcc gtgccgcccc agggtgccga 1920 
tatcggggac acgttattta ccctgtttcg 1980 
cctgtataac gtgtttgcct gggccttgga 2040 
cgtctttatc ctggatfcacg accaatcgcc 2100 
tacctccggg atggtccaga cccacgtcac 2160 
cctggcgcgc acgtttgccc gggagatggg 2220 
taccggaagg aacccgcgct atccggatcc 2280 
tgacgaatct ggatataaga tcgttggtgt 2340 
tgttcaggat ctcgataaga tacgttcatt 2400 
atttaatagc tccatgtcaa caagaataaa 2460 
cagctcatct gcaatgcatt aatgcattga 2520 
cgaagagaag aatagcttag cagagctatt 2580 
caacggtcgt caagagacct acgagactga 2640 
tatttgtctc taattgtact ttgacatgct 2700 
aaaattccgt caccagccct gggttcgcaa 2760 
caagcctaca ggacacacat tcatcgtagg 2820 
ggtatacaat agtaaccatg catggttgcc 2880 
ggccgaaact tttttacaac tctcctatga 2940 
gtttagaggt aatccttctt tctagaagtc 3000 
tctccactcg acctgcaggc atgcaagctt 3060 
gtgaaattgt tatccgctca caattccaca 3120 
agcctggggt gcctaatgag tgagctaact 3180 
tttccagtcg ggaaacctgt cgtgccagag 3240 
gcgcggggag aggcggtttg cgtattgggc 3300 
tgcgctcggt cgttcggctg cggcgagcgg 3360 
tatccacaga atcaggggat aacgcaggaa 3420 
ccaggaaccg taaaaaggcc gcgttgctgg 3480 
agcatcacaa aaatcgacgc tcaagtcaga 3540 
accaggcgtt tccccctgga agctccctcg 3600 
ccggatacct gtccgccttt ctcccttcgg 3660 
gtaggtatct cagttcggtg taggtcgttc 3720 
ccgttcagcc cgaccgctgc gccttatccg 3780 
gacacgactt atcgccactg gcagcagcca 3840 
taggcggtgc tacagagttc ttgaagtggt 3900 
tatttggtat ctgcgctctg ctgaagccag 3960 
gatccggcaa acaaaccacc gctggtagcg 4020 
cgcgcagaaa aaaaggatct caagaagatc 4080 
agtggaacga aaactcacgt taagggattt 4140 
cctagatcct tttaaattaa aaatgaagtt 4200 
cttggtctga cagttaccaa tgcttaatca 4260 
ttcgttcatc catagttgcc tgactccccg 4320 
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tcgtgtagat aactacgata cgggagggct 
cgcgagaccc acgctcaccg gctccagatt 
ccgagcgcag aagtggtcct gcaactttat 
gggaagctag agtaagtagt tcgccagtta 
5 caggcatcgt ggtgtcacgc tcgtcgtttg 
gatcaaggcg agttacatga tcccccatgt 
ctccgatcgt tgtcagaagt aagttggccg 
tgcataattc tcttactgtc atgccatccg 
caaccaagtc attctgagaa tagtgtatgc 

10 tacgggataa taccgcgcca catagcagaa 
cttcggggcg aaaactctca aggatcttac 
ctcgtgcacc caactgatct tcagcatctfc 
aaacaggaag gcaaaatgcc gcaaaaaagg 
tcatactctt cctttttcaa tattattgaa 

15 gatacatatt tgaatgtatt tagaaaaata 
gaaaagtgcc acctgacgtc taagaaacca 
ggcgtatcac gaggcccttt cgtctcgcgc 
acatgcagct cccggagacg gtcacagctt 
cccgtcaggg cgcgtcagcg ggtgttggcg 

20 cagagcagat tgtactgaga gtgcaccata 
taggaagcag cccagtagta ggttgaggcc 
atgcaaggag atggcgccca acagtccccc 
gaaacaagcg ctcatgagcc cgaagtggcg 
gatataggcg ccagcaaccg cacctgtggc 

25 gtagaggatc tggctagcga tgaccctgct 
gaacggcgtt accagaaact cagaaggttc 
acgagagaga tgatagggtc tgcttcagta 
tattgtcgtt agaacgcggc tacaattaat 
ttaggtgaca ctata 

30 



taccatctgg ccccagtgct gcaatgatac 4380 
tatcagcaat aaaccagcca gccggaaggg 4440 
ccgcctccat ccagtctatt aattgttgcc 4500 
atagtttgcg caacgttgtt gccattgcta 4560 
gtatggcttc attcagctcc ggttcccaac 4620 
tgtgcaaaaa agcggttagc tccttcggtc 4680 
cagtgttatc actcatggtt atggcagcac 4740 
taagatgctt ttctgtgact ggtgagtact 4800 
ggcgaccgag ttgctcttgc ccggcgtcaa 4860 
ctttaaaagt gctcatcatt ggaaaacgtt 4920 
cgctgttgag atccagttcg atgtaaccca 4980 
ttactttcac cagcgtttct gggtgagcaa 5040 
gaataagggc gacacggaaa tgttgaatac 5100 
gcatttatca gggttattgt ctcatgagcg 5160 
aacaaatagg ggttccgcgc acatttcccc 5220 
ttattatcat gacattaacc tataaaaata 5280 
gtttcggtga tgacggtgaa aacctctgac 5340 
gtctgtaagc ggatgccggg agcagacaag 5400 
ggtgtcgggg ctggcttaac tatgcggcat 5460 
tcgacgctct cccttatgcg actcctgcat 5520 
gttgagcacc gccgccgcaa ggaatggtgc 5580 
ggccacgggg cctgccacca tacccacgcc 5640 
agcccgatct tccccatcgg tgatgtcggc 5700 
gccggtgatg ccggccacga tgcgtccggc 5760 
gattggttcg ctgaccattt ccggggtgcg 5820 
gtccaaccaa accgactctg acggcagttt 5880 
agccagatgc tacacaatta ggcttgtaca 5940 
acataacctt atgtatcata cacatacgat 6000 

6015 



<210> 51 
<211> 18 
<212> DNA 
35 <213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
B1042E05 

<400> 51 

cgcgcgtatc ctattgcc 18 



45 <210> 52 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
50 <400> 52 

gccggaaatg ttgtacctac 20 



<210> 53 

55 <211> 80 

<212> DNA 

<213> Artificial Sequence 



<220> 

60 <223> Description of Artificial Sequence: ICA217 
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<400> 53 

atgaccgctt gtcatacctg ccgcaagctt aaaactcggt gcgatcttga tccacgaggg 60 
catgcctgcc gccgctgcct ~ 80 

<210> 54 
<211> 80 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ICA218 



<400> 54 

atgactgctt gccacacctg ccgcaagctt aaaactcggt gcgatcttga tccacgaggg 60 
catgcctgcc gccgctgcct 80 
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